Episode

BanglaForge: LLM Collaboration with Self-Refinement for Bangla Code Generation

Mahir Labib Dihan,Sadif Ahmed,Md Nafiu Rahman

Dec 22, 2025•8:07

Software EngineeringComputation and Language

No ratings yet

Abstract

Bangla is a low-resource language for code generation, lacking large-scale annotated datasets and tools to transform natural language specifications into executable programs. This makes Bangla-to-code generation a challenging task requiring innovative solutions. To address this, we introduce BanglaForge, a novel framework for generating code from Bangla function descriptions. BanglaForge leverages a retrieval-augmented dual-model collaboration paradigm with self-refinement, combining in-context learning, llm-based translation, systematic prompt engineering, and iterative self-refinement based on execution feedback, where a coder generates initial solutions and a reviewer enhances them for robustness. On the BLP-2025 Bangla Code Generation benchmark, BanglaForge achieves a competitive Pass@1 accuracy of 84.00%, demonstrating the effectiveness of retrieval, model collaboration, and self-refinement for low-resource Bangla code generation.

Links & Resources

View on arXiv Download PDF

Authors

Mahir Labib Dihan Sadif Ahmed Md Nafiu Rahman

Cite This Paper

arXiv:2512.19122

Year:2025

Category:cs.SE

APA

Dihan, M. L., Ahmed, S., Rahman, M. N. (2025). BanglaForge: LLM Collaboration with Self-Refinement for Bangla Code Generation. arXiv preprint arXiv:2512.19122.

MLA

Mahir Labib Dihan, Sadif Ahmed, and Md Nafiu Rahman. "BanglaForge: LLM Collaboration with Self-Refinement for Bangla Code Generation." arXiv preprint arXiv:2512.19122 (2025).