Skip to content

Activity

Merge pull request #74 from Krisseck/werkzeug2

Pull request merge
0cc4mpushed 5 commits to latestgptq • e1127b9…1c0efaa • 
on Oct 12, 2023

Merge pull request #70 from pi6am/feat/exllama-unban-eos

Pull request merge
pi6ampushed 2 commits to exllama • 36f53cc…fe53cb2 • 
on Aug 30, 2023

Merge pull request #69 from pi6am/merge/united-exllama

Pull request merge
pi6ampushed 58 commits to exllama • 6e64763…36f53cc • 
on Aug 30, 2023

Merge pull request #68 from pi6am/merge/united-exllama

Pull request merge
pi6ampushed 142 commits to exllama • 5229987…6e64763 • 
on Aug 28, 2023

Merge pull request #66 from pi6am/feat/exllama-config

Pull request merge
pi6ampushed 2 commits to exllama • 812df5e…5229987 • 
on Aug 28, 2023

Merge pull request #65 from pi6am/feat/exllama-badwords

Pull request merge
pi6ampushed 2 commits to exllama • 0d150e4…812df5e • 
on Aug 28, 2023

Merge pull request #64 from pi6am/fix/multinomial-workaround

Pull request merge
pi6ampushed 2 commits to exllama • b1895de…0d150e4 • 
on Aug 27, 2023

Merge pull request #63 from pi6am/feat/exllama-stoppers

Pull request merge
pi6ampushed 2 commits to exllama • 22fd499…b1895de • 
on Aug 23, 2023

Merge pull request #62 from pi6am/fix/exllama-eos-space

Pull request merge
0cc4mpushed 2 commits to exllama • 973aea1…22fd499 • 
on Aug 22, 2023

Deleted branch

0cc4mdeleted 4bit-plugin • 
on Jul 25, 2023

Remove exllama backend, pending further fixes

0cc4mpushed 1 commit to 4bit-plugin • 973aea1…7395306 • 
on Jul 23, 2023

Only import big python modules for GPTQ once they get used

0cc4mcreated exllama • 973aea1 • 
on Jul 23, 2023

Only import big python modules for GPTQ once they get used

0cc4mpushed 1 commit to 4bit-plugin • 49740aa…973aea1 • 
on Jul 23, 2023

Fix ntk alpha

0cc4mpushed 1 commit to 4bit-plugin • 31a984a…49740aa • 
on Jul 23, 2023

Automatically install exllama module

0cc4mpushed 1 commit to 4bit-plugin • a9aa04f…31a984a • 
on Jul 23, 2023

Merge remote-tracking branch 'upstream/united' into 4bit-plugin

0cc4mpushed 14 commits to 4bit-plugin • 09bb102…a9aa04f • 
on Jul 23, 2023

Fallback to transformers if hf_bleeding_edge not available

0cc4mpushed 2 commits to 4bit-plugin • 58908ab…09bb102 • 
on Jul 23, 2023

Revert aiserver.py changes

0cc4mpushed 1 commit to 4bit-plugin • 19f511d…58908ab • 
on Jul 19, 2023

Load GPTQ module from GPTQ repo docs

0cc4mpushed 2 commits to 4bit-plugin • 7516ecf…19f511d • 
on Jul 19, 2023

Merge upstream changes, fix conflict

0cc4mpushed 29 commits to 4bit-plugin • 9aa6c5f…7516ecf • 
on Jul 19, 2023

Merge upstream changes, fix conflict, adapt backends to changes

0cc4mpushed 125 commits to 4bit-plugin • 0e4b657…9aa6c5f • 
on Jul 19, 2023

Update README.md

0cc4mpushed 1 commit to latestgptq • 3560d09…e1127b9 • 
on Jul 5, 2023

Fix non-tuple return from gptq function

0cc4mpushed 1 commit to 4bit-plugin • c753671…0e4b657 • 
on Jun 28, 2023

Add exllama superhot positional embeddings compression support

0cc4mpushed 1 commit to 4bit-plugin • adad816…c753671 • 
on Jun 27, 2023

Remove rocm gptq install from environments file

0cc4mpushed 1 commit to latestgptq • 7f5c48a…3560d09 • 
on Jun 21, 2023

Remove rocm gptq install from environments file

0cc4mpushed 1 commit to 4bit-plugin • e8741a1…adad816 • 
on Jun 21, 2023

Disable scaled_dot_product_attention if torch version < 2

0cc4mpushed 1 commit to 4bit-plugin • a191855…e8741a1 • 
on Jun 20, 2023

Track token generation progress

0cc4mpushed 2 commits to 4bit-plugin • 0c7eaef…a191855 • 
on Jun 19, 2023

Fix AMD ROCm exllama inference

0cc4mpushed 1 commit to 4bit-plugin • ebf7e2c…0c7eaef • 
on Jun 13, 2023

Add v2 with bias support (e.g. for Tulu-30b)

0cc4mpushed 2 commits to latestgptq • 8e4d79a…7f5c48a • 
on Jun 12, 2023