Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications
of Agentic AI
Paper
• 2505.19443
• Published
• 15
Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in
LLMs
Paper
• 2506.19290
• Published
• 53
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of
Coding Tasks
Paper
• 2105.12655
• Published
StarCoder 2 and The Stack v2: The Next Generation
Paper
• 2402.19173
• Published
• 152
SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language
Models in Resolving Real-World Bugs
Paper
• 2504.14757
• Published
OctoPack: Instruction Tuning Code Large Language Models
Paper
• 2308.07124
• Published
• 32
rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale
Verified Dataset
Paper
• 2505.21297
• Published
• 29
Developer-LLM Conversations: An Empirical Study of Interactions and
Generated Code Quality
Paper
• 2509.10402
• Published
• 6
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
Paper
• 2306.08568
• Published
• 33
Magicoder: Source Code Is All You Need
Paper
• 2312.02120
• Published
• 82
Granite Code Models: A Family of Open Foundation Models for Code
Intelligence
Paper
• 2405.04324
• Published
• 25
Knowledge Transfer from High-Resource to Low-Resource Programming
Languages for Code LLMs
Paper
• 2308.09895
• Published
• 1
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Paper
• 2411.04905
• Published
• 127
OpenCodeInterpreter: Integrating Code Generation with Execution and
Refinement
Paper
• 2402.14658
• Published
• 83
Infinity Instruct: Scaling Instruction Selection and Synthesis to
Enhance Language Models
Paper
• 2506.11116
• Published
• 5
Thinking LLMs: General Instruction Following with Thought Generation
Paper
• 2410.10630
• Published
• 20
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for
Coding
Paper
• 2503.02951
• Published
• 33
SWE-QA: Can Language Models Answer Repository-level Code Questions?
Paper
• 2509.14635
• Published
• 35
CodeDPO: Aligning Code Models with Self Generated and Verified Source
Code
Paper
• 2410.05605
• Published
• 1
CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance
Paper
• 2502.04350
• Published
• 11
Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions
for Large Language Models
Paper
• 2407.21077
• Published
• 2
OpenCodeReasoning: Advancing Data Distillation for Competitive Coding
Paper
• 2504.01943
• Published
• 15
Training Long-Context, Multi-Turn Software Engineering Agents with
Reinforcement Learning
Paper
• 2508.03501
• Published
• 59
Dream-Coder 7B: An Open Diffusion Language Model for Code
Paper
• 2509.01142
• Published
PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model
Reasoning
Paper
• 2509.19894
• Published
• 34
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
Paper
• 2502.07316
• Published
• 50
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs
with Nothing
Paper
• 2406.08464
• Published
• 71
BigCodeArena: Unveiling More Reliable Human Preferences in Code
Generation via Execution
Paper
• 2510.08697
• Published
• 39
Critique-Coder: Enhancing Coder Models by Critique Reinforcement
Learning
Paper
• 2509.22824
• Published
• 21
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective
Reinforcement Learning for LLM Reasoning
Paper
• 2506.01939
• Published
• 188
X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests
Paper
• 2601.06953
• Published
• 45