Co-Learning: Code Learning for Multi-Agent reinforcement Collaborative Framework with Conversational Natural Language Interfaces

You can find our paper at this link: https://arxiv.org/abs/2409.00985

Framework Overview

The Co-Learning framework consists of five key agents:

Main Agent: Supervises and interacts with users.
Correction Agent: Revises and corrects code.
Interpretation Agent: Explains programming logic to identify incorrect code.
Test Agent: Tests the corrected code.
Annotation Agent: Adds comments to the revised code for better user understanding.

These agents communicate through conversational interfaces, and the system employs Environmental Reinforcement Learning (E-RL) to self-improve and provide feedback to both the agents and human users. The Co-Learning framework utilizes models such as ERNIE, SparkDesk, and LLaMa for different agents, and it evaluates code correction performance using criteria such as passing probability tests, single loop computation time, and the number of required loops.

Key Contributions

Development of a Multi-Agent framework using multiple LLMs for code error correction.
Evaluation of LLM performance using an original dataset containing 702 error codes.
Exploration of reinforcement learning in a multi-agent environment based on large language models.
Benchmarking against existing frameworks, demonstrating significant improvements in accuracy and operating speed.

Workflow

The Co-Learning framework operates within an environment created by the Main Agent. The workflow is as follows:

The Correction Agent uses a default LLM to make an initial correction to the input error code.
The Test Agent then evaluates the corrected code using test samples. If the code passes all tests, it is sent to the Annotation Agent for final annotation and output.
If the code fails any test, it is passed to the Interpretation Agent for further analysis. This process is stored in memory as an E-RL prompt.
The system then selects the appropriate Correction Agent using reinforcement learning and generates a new code based on the memorized data and interpretation, forming a loop until the code passes all tests.

The system is designed to enhance code error correction by mimicking human debugging processes and adapting in real-time to select the most appropriate LLM model based on E-RL.

Dataset

We hope that this dataset (we create) can serve as a valuable resource for further research and development in the field of code error correction, enabling the creation of more robust and efficient automated programming tools. You can find our dataset in the dataset folder which includes 702 error code samples used for evaluation.

Subset of the Dataset Samples

Error Code	Test List	Challenge Test List
`def remove_Occ(string, character):`	`[assert remove_Occ("hello","l") == "heo",`	`[assert remove_Occ("hellololl","l") == "heolol",`
	`assert remove_Occ("abcda","a") == "bcd",`	`assert remove_Occ("","l") == "",`
	`assert remove_Occ("PHP","P") == "H"]`	`assert remove_Occ("","l") == ""]`
`def is_woodall(number):`	`[assert is_woodall(383) == True,`	`[assert is_woodall(32212254719) == True,`
	`assert is_woodall(254) == False,`	`assert is_woodall(32212254718) == False,`
	`assert is_woodall(200) == False]`	`assert is_woodall(159) == True]`

API Key Acquisition

To utilize the APIs required for this project, you need to obtain the necessary API keys. Follow the instructions below to get started:

1. Baidu/ Llama API Key

Step 1: Visit the Baidu AI Open Platform.
Step 2: Log in or sign up for a Baidu account.
Step 3: After logging in, navigate to the "Console" (控制台) to create a new application.
Step 4: Under "Application Management" (应用管理), select "Create Application" (创建应用).
Step 5: Choose the services you need (such as NLP, OCR, etc.) and then you will be provided with an API Key and Secret Key. You can also get Llama key in this platform

You can access the platform and start the process here: Baidu AI Open Platform.

2. Spark API Key

Step 1: Visit the iFLYTEK Open Platform.
Step 2: Log in or sign up for an iFLYTEK account.
Step 3: Once logged in, navigate to the "Console" (控制台).
Step 4: Under "My Applications" (我的应用), click on "Create Application" (创建应用).
Step 5: After creating the application, you will be able to access the API Key, Secret Key, and App ID.

Start the process to obtain your Spark API key here: iFLYTEK Open Platform.

Getting Started

To get started with the Co-Learning framework, you can clone this repository and follow the setup instructions provided.

git clone https://github.com/yuqian2003/Co_Learning.git
cd Co_Learning

python Co_learning.py

Main Results

Co-Learning with Different LLM Performance Comparison

Method	1 loop	2 loops	3 loops	4 loops	5 loops	Average running time (s)	Accuracy (%)
Co-Learning (ERNIE 4.0)	337	60	31	29	245	137.5	65.09
Co-Learning (Llama 3-8b)	317	81	32	21	251	112.8	64.24
Co-Learning (Spark V3)	319	48	14	4	317	57.7	54.84
Co-Learning (E-RL)	280	104	65	27	226	99.8	67.80

Case Study

Citation

If you use this code in your research, please cite the following paper:

@misc{yu2024colearningcodelearningmultiagent,
      title={Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language Interfaces}, 
      author={Jiapeng Yu and Yuqian Wu and Yajing Zhan and Wenhao Guo and Zhou Xu and Raymond Lee},
      year={2024},
      eprint={2409.00985},
      archivePrefix={arXiv},
      primaryClass={cs.SE},
      url={https://arxiv.org/abs/2409.00985}, 
}

Acknowledgments:

This research was supported by Beijing Normal University-Hong Kong Baptist University United International College (UIC) and the IRADs lab. The authors express their gratitude for providing the necessary computational facilities.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Dataset		Dataset
Log		Log
history		history
images		images
utils		utils
.gitattributes		.gitattributes
Co_learning.py		Co_learning.py
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Co-Learning: Code Learning for Multi-Agent reinforcement Collaborative Framework with Conversational Natural Language Interfaces

Framework Overview

Key Contributions

Workflow

Dataset

Subset of the Dataset Samples

API Key Acquisition

1. Baidu/ Llama API Key

2. Spark API Key

Getting Started

Main Results

Co-Learning with Different LLM Performance Comparison

Case Study

Citation

Acknowledgments:

About

Releases

Packages

Languages

License

yuqian2003/Co_Learning

Folders and files

Latest commit

History

Repository files navigation

Co-Learning: Code Learning for Multi-Agent reinforcement Collaborative Framework with Conversational Natural Language Interfaces

Framework Overview

Key Contributions

Workflow

Dataset

Subset of the Dataset Samples

API Key Acquisition

1. Baidu/ Llama API Key

2. Spark API Key

Getting Started

Main Results

Co-Learning with Different LLM Performance Comparison

Case Study

Citation

Acknowledgments:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages