Skip to content

itshuey/tool-acquisition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Improving Tool Acquisition in Large Language Models

License: MIT Hugging Face Transformers

Supporting material for the thesis "Thinking Forwards, Backwards, and in Code: Improving Tool Acquisition in Large Language Models"

Authors: Huey Sun

Model Performance

Repository Structure

  • Results: The ToolSandbox evaluations results can be found here. Each result has a summary at the end, which was used to create the tables in the thesis.

  • Data: The data used to finetune the models can be found here. As Mistral's lack of chat formatting means that incremental masking is impossible (the output is not perfectly autoregressive at each conversation turn), each assistant message has been individually preformatted.

  • Scripts: The scripts used to convert json conversations to Mistral's expected formatting can be found here.

Other Code

You can find my fork of the torchtune library here, where I add support for tool chat and Mistral formatting in training.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages