BackGPT & BackChat

This is an experiment built on a fork of smol-gpt to train a 'previous word/token' type gpt text generation instead of 'next word/token'.

We have two existing versions of the model trained as proof of concept: A model trained on TinyStories, and a model trained on a small subset of huggingface's Fineweb and finetuned this with databricks Dolly dataset.

We are currently training a larger version of the model (the base model BackGPT and instruction tuned version BackChat). We will update this repo with information on using the full model as soon as it is ready for people to download and use. (Estimate early summer 2025).

It will be later made available on our website https://chat.thanks.fish

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
assets		assets
static		static
templates		templates
.DS_Store		.DS_Store
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
LLMA.jpg		LLMA.jpg
README.md		README.md
config.py		config.py
config_fine10.py		config_fine10.py
config_trained_backup_tinystories.py		config_trained_backup_tinystories.py
config_xcoax.py		config_xcoax.py
dataset.py		dataset.py
finetune_xcoax.py		finetune_xcoax.py
instruct_tune.py		instruct_tune.py
main.py		main.py
model.py		model.py
paper_notes.md		paper_notes.md
preprocess.py		preprocess.py
preprocess_chat.py		preprocess_chat.py
preprocess_fine10.py		preprocess_fine10.py
preprocess_instruct.py		preprocess_instruct.py
preprocess_xcoax.py		preprocess_xcoax.py
pyproject.toml		pyproject.toml
sample.py		sample.py
sample_xcoax.py		sample_xcoax.py
sample_xcoax_instruct.py		sample_xcoax_instruct.py
server.py		server.py
server_chat.py		server_chat.py
server_fine.py		server_fine.py
tokenizer.py		tokenizer.py
train.py		train.py
train.sh		train.sh
train_chat.py		train_chat.py
train_fine10.py		train_fine10.py
train_xcoax.py		train_xcoax.py
training.jpg		training.jpg
uv.lock		uv.lock
xcoax_backchat.bib		xcoax_backchat.bib

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BackGPT & BackChat

About

Uh oh!

Languages

License

isaac-art/backchat_early

Folders and files

Latest commit

History

Repository files navigation

BackGPT & BackChat

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages