Skip to content

v0.1.6

Compare
Choose a tag to compare
@guinmoon guinmoon released this 29 Jun 18:43
· 295 commits to main since this release

Changelog

  • Add gpt2 inference
  • Add replit inference
  • Add support for k_quants
  • Add chat reload button
  • Add gpt_neox updated
  • Fixed custom prompt format
    for example for ORCA its: ### User:\n{{prompt}}\n\n### Response:\n
  • Fixed RedPajma
  • Fixed context params on load
  • Fixed memory leak on reload model
  • Fixed autoscroll in message view