Activity

Merge pull request #74 from Krisseck/werkzeug2

Pull request merge

0cc4mpushed 5 commits to latestgptq • e1127b9…1c0efaa •

on Oct 12, 2023

Merge pull request #70 from pi6am/feat/exllama-unban-eos

Pull request merge

pi6ampushed 2 commits to exllama • 36f53cc…fe53cb2 •

on Aug 30, 2023

Merge pull request #69 from pi6am/merge/united-exllama

Pull request merge

pi6ampushed 58 commits to exllama • 6e64763…36f53cc •

on Aug 30, 2023

Merge pull request #68 from pi6am/merge/united-exllama

Pull request merge

pi6ampushed 142 commits to exllama • 5229987…6e64763 •

on Aug 28, 2023

Merge pull request #66 from pi6am/feat/exllama-config

Pull request merge

pi6ampushed 2 commits to exllama • 812df5e…5229987 •

on Aug 28, 2023

Merge pull request #65 from pi6am/feat/exllama-badwords

Pull request merge

pi6ampushed 2 commits to exllama • 0d150e4…812df5e •

on Aug 28, 2023

Merge pull request #64 from pi6am/fix/multinomial-workaround

Pull request merge

pi6ampushed 2 commits to exllama • b1895de…0d150e4 •

on Aug 27, 2023

Merge pull request #63 from pi6am/feat/exllama-stoppers

Pull request merge

pi6ampushed 2 commits to exllama • 22fd499…b1895de •

on Aug 23, 2023

Merge pull request #62 from pi6am/fix/exllama-eos-space

Pull request merge

0cc4mpushed 2 commits to exllama • 973aea1…22fd499 •

on Aug 22, 2023

Deleted branch

0cc4mdeleted 4bit-plugin •

on Jul 25, 2023

Remove exllama backend, pending further fixes

0cc4mpushed 1 commit to 4bit-plugin • 973aea1…7395306 •

on Jul 23, 2023

Only import big python modules for GPTQ once they get used

0cc4mcreated exllama • 973aea1 •

on Jul 23, 2023

Only import big python modules for GPTQ once they get used

0cc4mpushed 1 commit to 4bit-plugin • 49740aa…973aea1 •

on Jul 23, 2023

Fix ntk alpha

0cc4mpushed 1 commit to 4bit-plugin • 31a984a…49740aa •

on Jul 23, 2023

Automatically install exllama module

0cc4mpushed 1 commit to 4bit-plugin • a9aa04f…31a984a •

on Jul 23, 2023

Merge remote-tracking branch 'upstream/united' into 4bit-plugin

0cc4mpushed 14 commits to 4bit-plugin • 09bb102…a9aa04f •

on Jul 23, 2023

Fallback to transformers if hf_bleeding_edge not available

0cc4mpushed 2 commits to 4bit-plugin • 58908ab…09bb102 •

on Jul 23, 2023

Revert aiserver.py changes

0cc4mpushed 1 commit to 4bit-plugin • 19f511d…58908ab •

on Jul 19, 2023

Load GPTQ module from GPTQ repo docs

0cc4mpushed 2 commits to 4bit-plugin • 7516ecf…19f511d •

on Jul 19, 2023

Merge upstream changes, fix conflict

0cc4mpushed 29 commits to 4bit-plugin • 9aa6c5f…7516ecf •

on Jul 19, 2023

Merge upstream changes, fix conflict, adapt backends to changes

0cc4mpushed 125 commits to 4bit-plugin • 0e4b657…9aa6c5f •

on Jul 19, 2023

Update README.md

0cc4mpushed 1 commit to latestgptq • 3560d09…e1127b9 •

on Jul 5, 2023

Fix non-tuple return from gptq function

0cc4mpushed 1 commit to 4bit-plugin • c753671…0e4b657 •

on Jun 28, 2023

Add exllama superhot positional embeddings compression support

0cc4mpushed 1 commit to 4bit-plugin • adad816…c753671 •

on Jun 27, 2023

Remove rocm gptq install from environments file

0cc4mpushed 1 commit to latestgptq • 7f5c48a…3560d09 •

on Jun 21, 2023

Remove rocm gptq install from environments file

0cc4mpushed 1 commit to 4bit-plugin • e8741a1…adad816 •

on Jun 21, 2023

Disable scaled_dot_product_attention if torch version < 2

0cc4mpushed 1 commit to 4bit-plugin • a191855…e8741a1 •

on Jun 20, 2023

Track token generation progress

0cc4mpushed 2 commits to 4bit-plugin • 0c7eaef…a191855 •

on Jun 19, 2023

Fix AMD ROCm exllama inference

0cc4mpushed 1 commit to 4bit-plugin • ebf7e2c…0c7eaef •

on Jun 13, 2023

Add v2 with bias support (e.g. for Tulu-30b)

0cc4mpushed 2 commits to latestgptq • 8e4d79a…7f5c48a •

on Jun 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge pull request #74 from Krisseck/werkzeug2

Merge pull request #70 from pi6am/feat/exllama-unban-eos

Merge pull request #69 from pi6am/merge/united-exllama

Merge pull request #68 from pi6am/merge/united-exllama

Merge pull request #66 from pi6am/feat/exllama-config

Merge pull request #65 from pi6am/feat/exllama-badwords

Merge pull request #64 from pi6am/fix/multinomial-workaround

Merge pull request #63 from pi6am/feat/exllama-stoppers

Merge pull request #62 from pi6am/fix/exllama-eos-space

Deleted branch

Remove exllama backend, pending further fixes

Only import big python modules for GPTQ once they get used

Only import big python modules for GPTQ once they get used

Fix ntk alpha

Automatically install exllama module

Merge remote-tracking branch 'upstream/united' into 4bit-plugin

Fallback to transformers if hf_bleeding_edge not available

Revert aiserver.py changes

Load GPTQ module from GPTQ repo docs

Merge upstream changes, fix conflict

Merge upstream changes, fix conflict, adapt backends to changes

Update README.md

Fix non-tuple return from gptq function

Add exllama superhot positional embeddings compression support

Remove rocm gptq install from environments file

Remove rocm gptq install from environments file

Disable scaled_dot_product_attention if torch version < 2

Track token generation progress

Fix AMD ROCm exllama inference

Add v2 with bias support (e.g. for Tulu-30b)