GenEOL: Harnessing the Generative Power of LLMs for Training-Free Sentence Embeddings
Raghuveer Thirukovalluru, Bhuwan Dhingra (2024)
In arXiv. [ arxiv | code ]
Raghuveer Thirukovalluru, Bhuwan Dhingra (2024)
In arXiv. [ arxiv | code ]
Enhancing Large Language Models’ Situated Faithfulness to External Contexts
Yukun Huang, Sanxing Chen, Hongyi Cai, Bhuwan Dhingra (2024)
In arXiv. [ arxiv | code | data ]
Yukun Huang, Sanxing Chen, Hongyi Cai, Bhuwan Dhingra (2024)
In arXiv. [ arxiv | code | data ]
Real-time Fake News from Adversarial Feedback
Sanxing Chen, Yukun Huang, Bhuwan Dhingra (2024)
In arXiv. [ arxiv ]
Sanxing Chen, Yukun Huang, Bhuwan Dhingra (2024)
In arXiv. [ arxiv ]
ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods
Roy Xie, Junlin Wang, Ruomin Huang, Minxing Zhang, Rong Ge, Jian Pei, Neil Gong, Bhuwan Dhingra (2024)
In arXiv. [ link | arxiv | code | twitter ]
Roy Xie, Junlin Wang, Ruomin Huang, Minxing Zhang, Rong Ge, Jian Pei, Neil Gong, Bhuwan Dhingra (2024)
In arXiv. [ link | arxiv | code | twitter ]
Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Junlin Wang, Siddhartha Jain, Dejiao Zhang, Baishakhi Ray, Varun Kumar, Ben Athiwaratkun (2024)
In arXiv. [ arxiv ]
Junlin Wang, Siddhartha Jain, Dejiao Zhang, Baishakhi Ray, Varun Kumar, Ben Athiwaratkun (2024)
In arXiv. [ arxiv ]
Mixture-of-Agents Enhances Large Language Model Capabilities
Junlin Wang, Jue Wang, Ben Athiwaratkun, Ce Zhang, James Zou (2024)
In arXiv. [ arxiv | code | twitter ]
Junlin Wang, Jue Wang, Ben Athiwaratkun, Ce Zhang, James Zou (2024)
In arXiv. [ arxiv | code | twitter ]
Development and validation of VaxConcerns: A taxonomy of vaccine concerns and misinformation with Crowdsource-Viability
Rickard Stureborg, Jenna Nichols, Bhuwan Dhingra, Jun Yang, Walter Orenstein, Robert A. Bednarczyk, Lavanya Vasudevan (2024)
In Vaccine. [ link ]
Rickard Stureborg, Jenna Nichols, Bhuwan Dhingra, Jun Yang, Walter Orenstein, Robert A. Bednarczyk, Lavanya Vasudevan (2024)
In Vaccine. [ link ]
Sequence Reducible Holdout Loss for Language Model Pretraining
Raghuveer Thirukovalluru, Nicholas Monath, Bhuwan Dhingra, Sam Wiseman (2024)
In COLING. [ link | code ]
Raghuveer Thirukovalluru, Nicholas Monath, Bhuwan Dhingra, Sam Wiseman (2024)
In COLING. [ link | code ]
Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications
Junlin Wang*, Tianyi Yang*, Roy Xie, Bhuwan Dhingra (2024)
In NAACL Findings. [ link | arxiv | code | twitter ]
Junlin Wang*, Tianyi Yang*, Roy Xie, Bhuwan Dhingra (2024)
In NAACL Findings. [ link | arxiv | code | twitter ]
SumCSE: Summary as a transformation for Contrastive Learning
Raghuveer Thirukovalluru, Xiaolan Wang, Jun Chen, Shuyang Li, Jie Lei, Rong Jin, Bhuwan Dhingra (2024)
In NAACL Findings. [ link | code ]
Raghuveer Thirukovalluru, Xiaolan Wang, Jun Chen, Shuyang Li, Jie Lei, Rong Jin, Bhuwan Dhingra (2024)
In NAACL Findings. [ link | code ]
Tailoring Vaccine Messaging with Common-Ground Opinions
Rickard Stureborg, Sanxing Chen, Roy Xie, Aayushi Patel, Christopher Li, Chloe Qinyu Zhu, Tingnan Hu, Jun Yang, Bhuwan Dhingra (2024)
In NAACL Findings. [ link | arxiv | code | data | twitter ]
Rickard Stureborg, Sanxing Chen, Roy Xie, Aayushi Patel, Christopher Li, Chloe Qinyu Zhu, Tingnan Hu, Jun Yang, Bhuwan Dhingra (2024)
In NAACL Findings. [ link | arxiv | code | data | twitter ]
Your Large Language Models Are Leaving Fingerprints
Hope McGovern, Rickard Stureborg, Yoshi Suhara, Dimitris Alikaniotis (2024)
In arXiv. [ arxiv ]
Hope McGovern, Rickard Stureborg, Yoshi Suhara, Dimitris Alikaniotis (2024)
In arXiv. [ arxiv ]
Atomic Self-Consistency for Better Long Form Generations
Raghuveer Thirukovalluru, Yukun Huang, Bhuwan Dhingra (2024)
In arXiv. [ arxiv ]
Raghuveer Thirukovalluru, Yukun Huang, Bhuwan Dhingra (2024)
In arXiv. [ arxiv ]
Large Language Models are Inconsistent and Biased Evaluators
Rickard Stureborg, Dimitris Alikaniotis, Yoshi Suhara (2024)
In arXiv. [ arxiv ]
Rickard Stureborg, Dimitris Alikaniotis, Yoshi Suhara (2024)
In arXiv. [ arxiv ]
ChatShop: Interactive Information Seeking with Language Agents
Sanxing Chen, Sam Wiseman, Bhuwan Dhingra (2024)
In arXiv. [ arxiv | code ]
Sanxing Chen, Sam Wiseman, Bhuwan Dhingra (2024)
In arXiv. [ arxiv | code ]
IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations
Deqing Fu*, Ghazal Khalighinejad*, Ollie Liu*, Bhuwan Dhingra, Dani Yogatama, Robin Jia, Willie Neiswanger (2024)
In COLM. [ link | arxiv | data | twitter ]
Deqing Fu*, Ghazal Khalighinejad*, Ollie Liu*, Bhuwan Dhingra, Dani Yogatama, Robin Jia, Willie Neiswanger (2024)
In COLM. [ link | arxiv | data | twitter ]
Extracting Polymer Nanocomposite Samples from Full-Length Documents
Ghazal Khalighinejad, Defne Circi, L. Brinson, Bhuwan Dhingra (2024)
In ACL Findings. [ arxiv | code | twitter ]
Ghazal Khalighinejad, Defne Circi, L. Brinson, Bhuwan Dhingra (2024)
In ACL Findings. [ arxiv | code | twitter ]
Characterizing the Confidence of Large Language Model-Based Automatic Evaluation Metrics
Rickard Stureborg, Dimitris Alikaniotis, Yoshi Suhara (2024)
In EACL. [ link ]
Rickard Stureborg, Dimitris Alikaniotis, Yoshi Suhara (2024)
In EACL. [ link ]
Adversarial Math Word Problem Generation
Roy Xie, Chengxuan Huang, Junlin Wang, Bhuwan Dhingra (2024)
In arXiv. [ arxiv | code | twitter ]
Roy Xie, Chengxuan Huang, Junlin Wang, Bhuwan Dhingra (2024)
In arXiv. [ arxiv | code | twitter ]
Calibrating Long-form Generations from Large Language Models
Yukun Huang, Yixin Liu, Raghuveer Thirukovalluru, Arman Cohan, Bhuwan Dhingra (2024)
In arXiv. [ arxiv | code ]
Yukun Huang, Yixin Liu, Raghuveer Thirukovalluru, Arman Cohan, Bhuwan Dhingra (2024)
In arXiv. [ arxiv | code ]
Hierarchical Multi-Label Classification of Online Vaccine Concerns
Chloe Qinyu Zhu*, Rickard Stureborg*, Bhuwan Dhingra (2024)
In AI for Health Equity and Fairness. [ link | arxiv ]
Chloe Qinyu Zhu*, Rickard Stureborg*, Bhuwan Dhingra (2024)
In AI for Health Equity and Fairness. [ link | arxiv ]
Do Not Harm Protected Groups in Debiasing Language Representation Models
Chloe Qinyu Zhu, Rickard Stureborg, Brandon Fain (2023)
In arXiv. [ arxiv ]
Chloe Qinyu Zhu, Rickard Stureborg, Brandon Fain (2023)
In arXiv. [ arxiv ]
Exploring the Effect of Frequency Resolution in FNet
Gregory Szumel, Ghazal Khalighinejad, Rickard Stureborg, Sam Wiseman (2023)
In SustaiNLP. [ link ]
Gregory Szumel, Ghazal Khalighinejad, Rickard Stureborg, Sam Wiseman (2023)
In SustaiNLP. [ link ]
Learning the Legibility of Visual Text Perturbations
Dev Seth, Rickard Stureborg, Danish Pruthi, Bhuwan Dhingra (2023)
In EACL. [ link | arxiv ]
Dev Seth, Rickard Stureborg, Danish Pruthi, Bhuwan Dhingra (2023)
In EACL. [ link | arxiv ]
Interface Design for Crowdsourcing Hierarchical Multi-Label Text Annotations
Rickard Stureborg, Bhuwan Dhingra, Jun Yang (2023)
In CHI. [ link | arxiv ]
Rickard Stureborg, Bhuwan Dhingra, Jun Yang (2023)
In CHI. [ link | arxiv ]