publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
-
preprintEfficient Many-Shot In-Context Learning with Dynamic Block-Sparse AttentionIn [under submission], 2025
-
preprintNot-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design DecisionsIn [under submission], 2025
-
ICLRBetter Instruction-Following Through Minimum Bayes RiskIn International Conference on Learning Representations (ICLR), 2025
-
NAACLIn-context learning with long-context models: An in-depth explorationIn 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025
2024
-
CONDAA Taxonomy for Data Contamination in Large Language ModelsIn The 1st Workshop on Data Contamination (CONDA), 2024
-
TMLRFrom Decoding to Meta-Generation: Inference-time Algorithms for Large Language ModelsIn Transactions on Machine Learning Research, 2024
2023
-
EMNLPTo Build Our Future, We Must Know Our Past: Contextualizing Paradigm Shifts in Natural Language ProcessingIn Empirical Methods in Natural Language Processing., 2023
-
Big PictureIt’s MBR All the Way Down: Modern Generation Techniques Through the Lens of Minimum Bayes RiskIn Proceedings of the First Big Picture Workshop., 2023
-
NeurIPSUnlimiformer: Long-Range Transformers with Unlimited Length InputIn Conference on Neural Information Processing Systems., 2023
-
TACLBridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language GenerationIn Transactions of the Association of Computational Linguistics., 2023
-
EMNLP DemoPrompt2Model: Generating Deployable Models from Natural Language InstructionsIn Empirical Methods in Natural Language Processing: Demo Track., 2023
-
PreprintLLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMsIn arXiv., 2023