Name		Name	Last commit message	Last commit date
parent directory ..
HumanEval_and_MBPP_code_gen.ipynb		HumanEval_and_MBPP_code_gen.ipynb
HumanEval_and_MBPP_test_gen.ipynb		HumanEval_and_MBPP_test_gen.ipynb
README.md		README.md

README.md

Python Code and Test Generation Datasets

This folder contains two notebooks.

One will download the HumanEval and MBPP datasets used for Microsoft CodeT for tuning a model for Python code generation from function docstrings, augment the data into prompt and solution pairs and write them to .jsonl files.

The other will download the data used for Microsoft CodeT for tuning a model for Python test generation from corresponding function docstrings, augment the data into prompt and solution pairs and write them to .jsonl files.

All datasets are then uploaded to HuggingFace Hub, the code generation data is uploaded separately from the test generation data.

Requirements

Both notebooks require the library requests.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

codet_humaneval_mbpp

codet_humaneval_mbpp

README.md

Python Code and Test Generation Datasets

Requirements

Files

codet_humaneval_mbpp

Directory actions

More options

Directory actions

More options

Latest commit

History

codet_humaneval_mbpp

Folders and files

parent directory

README.md

Python Code and Test Generation Datasets

Requirements