Customer-obsessed science
Research areas
-
November 20, 20254 min readA new evaluation pipeline called FiSCo uncovers hidden biases and offers an assessment framework that evolves alongside language models.
-
-
-
September 2, 20253 min read
-
Featured news
-
2024Training a supervised news summarization model requires large amounts of high-quality training data consisting of news articles paired with reference summaries. However, obtaining such data is costly, and existing datasets contain considerable amount of noise. We present a new large-scale and high-quality dataset for supervised abstractive news summarization containing 1.3 million training samples, which
-
ESWC 20242024We present an approach to represent composite values (lists and maps, in particular) as literals in RDF data, and to extend SPARQL with features related to such literals. These extensions include an aggregation function to produce these composite values, functions to operate on these composite values in expressions, and a new operator to unfold such composite values into their individual components. As
-
SIOP 20242024Correlating assessment scores with performance in role (PIR) metrics provides a powerful form of validation evidence, but is complicated by the absence of PIR metrics for applicants who were not hired. Traditional range restriction perspectives state that the problem is a lack of PIR metrics for low assessment scores, and typical corrections make strong assumptions about how the relationship among incumbents
-
PQC Standardization Conference 20242024The QC-MDPC code-based KEM BIKE is an alternative candidate for standardization for the NIST Post-Quantum Cryptography Standardization Project. Per NIST’s report [2] “The BIKE cryptosystem was initially designed for ephemeral key use but has now been claimed to also support static key use”. BIKE uses the BGF decoder of [9] where its Decoding Failure Rate (DFR) is estimated by means of an extrapolation method
-
ICLR 2024 Workshop on Secure and Trustworthy Large Language Models (SET LLM)2024In this work, we propose sequence-level certainty as a common theme over hallucination in Knowledge Grounded Dialogue Generation (KGDG). We explore the correlation between the level of hallucination in model responses and two types of sequence-level certainty: probabilistic certainty and semantic certainty. Empirical results reveal that higher levels of both types of certainty in model responses are correlated
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all