Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization Paper • 2206.04007 • Published Jun 8, 2022
Understanding Memorisation in LLMs: Dynamics, Influencing Factors, and Implications Paper • 2407.19262 • Published Jul 27, 2024
Revisiting Privacy, Utility, and Efficiency Trade-offs when Fine-Tuning Large Language Models Paper • 2502.13313 • Published Feb 9
Overview of the HASOC Subtrack at FIRE 2023: Identification of Tokens Contributing to Explicit Hate in English by Span Detection Paper • 2311.09834 • Published Nov 16, 2023
The Art of Embedding Fusion: Optimizing Hate Speech Detection Paper • 2306.14939 • Published Oct 8, 2023
Beyond Negativity: Re-Analysis and Follow-Up Experiments on Hope Speech Detection Paper • 2306.01742 • Published May 10, 2023
Fine-tuning vs. In-context Learning in Large Language Models: A Formal Language Learning Perspective Paper • 2604.23267 • Published 17 days ago
Fractional Rotation, Full Potential? Investigating Performance and Convergence of Partial RoPE Paper • 2603.11611 • Published Mar 12
TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability Paper • 2507.19419 • Published Sep 30, 2025
Towards Reliable Latent Knowledge Estimation in LLMs: In-Context Learning vs. Prompting Based Factual Knowledge Extraction Paper • 2404.12957 • Published Apr 19, 2024
QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs Paper • 2412.11763 • Published Dec 16, 2024
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon Paper • 2406.17746 • Published Jun 25, 2024
Rote Learning Considered Useful: Generalizing over Memorized Data in LLMs Paper • 2507.21914 • Published Jul 29, 2025
Hubble: a Model Suite to Advance the Study of LLM Memorization Paper • 2510.19811 • Published Oct 22, 2025
In Agents We Trust, but Who Do Agents Trust? Latent Source Preferences Steer LLM Generations Paper • 2602.15456 • Published Feb 17
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling Paper • 2304.01373 • Published Apr 3, 2023 • 9
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative_185B Text Generation • 1B • Updated 15 days ago • 41
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative_180B_Harvard_5B Text Generation • 1B • Updated 15 days ago • 22
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative_180B_Harvard_5B Text Generation • 1B • Updated 15 days ago • 22
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative_185B Text Generation • 1B • Updated 15 days ago • 41