faster-whisper integration ? #6816

ras0k · 2023-04-11T20:56:08Z

I did not read the whole thread about whisper GPU but can we avoid a lot of problems with VRAM and speed by switching to faster-whisper maybe ?

Purfview · 2023-04-11T21:34:08Z

How faster-whisper speed [on GPU] compares to whisper-ConstMe?

Purfview · 2023-04-11T21:42:51Z

I asked about whisper-ConstMe, not "openai/whisper".
Btw, I find large model's timestamps way less accurate than medium, when transcription is not better.

whisper-ConstMe can use any model.

ras0k · 2023-04-11T21:47:27Z

full benchmarks :

faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models.

This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.

Benchmark
For reference, here's the time and memory usage that are required to transcribe 13 minutes of audio using different implementations:

openai/whisper@6dea21fd
whisper.cpp@3b010f9
faster-whisper@cce6b53e

Large-v2 model on GPU

Implementation	Precision	Beam size	Time	Max. GPU memory	Max. CPU memory
openai/whisper	fp16	5	4m30s	11325MB	9439MB
faster-whisper	fp16	5	54s	4755MB	3244MB
faster-whisper	int8	5	59s	3091MB	3117MB

Executed with CUDA 11.7.1 on a NVIDIA Tesla V100S.

Small model on CPU

Implementation	Precision	Beam size	Time	Max. memory
openai/whisper	fp32	5	10m31s	3101MB
whisper.cpp	fp32	5	17m42s	1581MB
whisper.cpp	fp16	5	12m39s	873MB
faster-whisper	fp32	5	2m44s	1675MB
faster-whisper	int8	5	2m04s	995MB

Executed with 8 threads on a Intel(R) Xeon(R) Gold 6226R.

Purfview · 2023-04-11T21:48:31Z

how much faster is constme compared to openai/whisper ?

If it ran for me I wouldn't ask.
Why you are posting these pointless posts?

ras0k · 2023-04-11T21:49:20Z

Why you are posting these pointless posts?

because this would help the software ? why would we not integrate faster-whisper ? why are you so adversarial to contribution on an open-source project ?

Purfview · 2023-04-11T21:52:58Z

because this would help the software ? why would we not integrate faster-whisper ? why are you so adversarial to contribution on an open-source project ?

I asked a question, you answer with some irrelevant posts. I'm adversarial to nonsense...

ras0k · 2023-04-11T21:54:49Z

I asked a question, you answer with some irrelevant posts. I'm adversarial to nonsense...

I think const-me/whisper is a Windows port of the whisper.cpp implementation.
Which in turn is a C++ port of OpenAI's Whisper automatic speech recognition (ASR) model.

Purfview · 2023-04-11T21:57:47Z

I think const-me/whisper is a Windows port of the whisper.cpp implementation. Which in turn is a C++ port of OpenAI's Whisper automatic speech recognition (ASR) model.

Are you a GPT-3 bot tuned on 4chan?

ras0k · 2023-04-11T22:00:37Z

How faster-whisper speed [on GPU] compares to whisper-ConstMe?

at least 5x faster on CPU and 10x faster if you use GPU

Purfview · 2023-04-11T22:06:24Z

at least 5x faster

at least 5x faster on CPU and 10x faster if you use GPU

So, few minutes ago you didn't knew what whisper-ConstMe is, and now you are posting "benchmarks" out of you ass...

how was my post irrelevent ? it's a port of whisper.cpp and the benchmark is testing whisper.cpp

If you are not a bot then clearly with some mental deficiency.

Purfview · 2023-04-11T22:11:09Z

the benchmarks are from the repo and i am autistic.

I see... Take ten deep breaths and no need to type more posts.
I'm unsubscribing from this thread.

rsmith02ct · 2023-04-12T04:01:24Z

I'm not sure why this post devolved into insults instead of mutual understanding.

whisper-ConstMe is a GPU-enabled implementation of Whisper. Does faster-whisper provide any benefits in terms of speed or accuracy or GPU ram usage compared to it? ConstMe is already integrated into SubtitleEdit which is why the question is relevant.

ras0k · 2023-04-12T12:49:08Z

I'm not sure why this post devolved into insults instead of mutual understanding.

whisper-ConstMe is a GPU-enabled implementation of Whisper. Does faster-whisper provide any benefits in terms of speed or accuracy or GPU ram usage compared to it? ConstMe is already integrated into SubtitleEdit which is why the question is relevant.

Yes it does go about 5x faster with the optimizations they provided, that is what the benchmark I posted show, they are on the faster-whisper GitHub. You can also use Whisper-cTranslate2 directly

ras0k · 2023-04-12T12:50:06Z

or GPU ram usage compared to it?

we also save a lot of VRAM which means we can run large-v2 on 4gb GPUs

ras0k · 2023-04-12T12:51:28Z

Btw, I find large model's timestamps way less accurate than medium, when transcription is not better.

maybe for english medium is fine but for multilingual large-v2 is a lot more useful than having to download a specific model for each language

rsmith02ct · 2023-04-12T12:57:47Z

Const-Me also has huge speed boosts over CPU-only implementations. I'll assume ConstMe and Faster Whisper are comparable unless someone reports data to the contrary.

ras0k · 2023-04-12T12:59:47Z

Const-Me also has huge speed boosts over CPU-only implementations. I'll assume ConstMe and Faster Whisper are comparable unless someone reports data to the contrary.

can you provide a benchmark that shows this ?

ras0k · 2023-04-12T13:00:09Z

I am talking about 5x speed on GPU vs GPU, not cpu vs gpu

rsmith02ct · 2023-04-12T14:33:01Z

Const-me is a GPU implementation of CPP that is much faster. David M's experience here: https://www.youtube.com/watch?v=RRF5AS6JVtI&list=PLG8jlFKr-RtdO_r3YAp9cncEEqJRkIltB&index=83 I used CPP on my GTX 1050/Intel i7-7700HQ laptop on a 2:35 minute file in 13 seconds (tiny.en model). ConstMe was about 5 seconds With the Base model and the same 2:35 file Const-me. 8 seconds CPP: 22 seconds OpenAI (Python) no GPU: 52 seconds. Larger models may show more difference but my GPU only has 4GB ram. I don't see a need to test Faster Whisper unless it gets embedded with SubtitleEdit. Feel free to test it and report results.

…

On Wed, Apr 12, 2023 at 9:59 PM ras0k ***@***.***> wrote: Const-Me also has huge speed boosts over CPU-only implementations. I'll assume ConstMe and Faster Whisper are comparable unless someone reports data to the contrary. Const-Me is whisper.cpp which is a CPU-only implementation, no ? whiper.cpp is in the benchmark — Reply to this email directly, view it on GitHub <#6816 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5EWOZOMNH6M3F4B3C4V3V3XA2RKPANCNFSM6AAAAAAW2Y4LVI> . You are receiving this because you commented.Message ID: ***@***.***>

ras0k · 2023-04-12T14:35:56Z

I understand and respect your desire to not ponder on this but for me a tiny benchmark is completely useless, i am only talking about comparing whisper (in gpu mode) and faster-whisper (also gpu mode) on large-v2 because I believe there will be a use-case for a lot of users. I will do my best to provide the benchmarks that you asked me soon.

ras0k · 2023-04-12T14:37:09Z

Larger models may show more difference but my GPU only has 4GB ram.

you can already try large-v2 on faster-whisper with your GPU, is that not incentive enough to want it ?

rsmith02ct · 2023-04-12T14:40:28Z

You are the one who wants this implementation of Whisper to be included yet you haven't provided any test data to show how it is better than the current options. I care about workflow, not absolute speed. If it's not in SubtitleEdit or integrated into my NLE I don't have any reason to use it as it will slow me down. How much VRAM do you need for the large v2 model in Faster Whisper? That may limit its interest to users.

…

On Wed, Apr 12, 2023 at 11:36 PM ras0k ***@***.***> wrote: I understand and respect your desire to not ponder on this but for me a tiny benchmark is completely useless, i am only talking about comparing whisper (in gpu mode) and faster-whisper (also gpu mode) on large-v2 because I believe there will be a use-case for a lot of users. I will do my best to provide the benchmarks that you asked me soon. — Reply to this email directly, view it on GitHub <#6816 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5EWOZO423QTCNZ23RHBJ3TXA24VNANCNFSM6AAAAAAW2Y4LVI> . You are receiving this because you commented.Message ID: ***@***.***>

ras0k · 2023-04-12T14:41:29Z

How much VRAM do you need for the large v2 model in Faster Whisper? That
may limit its interest to users.

3.09 GB

https://huggingface.co/guillaumekln/faster-whisper-large-v2

ras0k · 2023-04-12T14:43:42Z

you haven't provided any test data to show how it is better than the
current options.

faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models.

This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.

I posted the full benchmarks up there in my first reply but I will also try subtitleEdit and post my results soon.

rsmith02ct · 2023-04-12T14:49:49Z

Is that the needed VRAM or the model size? I can't run the medium model (1.5gb) on my 4GB GPU FWIW.

…

On Wed, Apr 12, 2023 at 11:41 PM ras0k ***@***.***> wrote: How much VRAM do you need for the large v2 model in Faster Whisper? That may limit its interest to users. 3.09 GB https://huggingface.co/guillaumekln/faster-whisper-large-v2/tree/main — Reply to this email directly, view it on GitHub <#6816 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5EWOZKP5CJG2Z4MK6YJN2DXA25KLANCNFSM6AAAAAAW2Y4LVI> . You are receiving this because you commented.Message ID: ***@***.***>

ras0k · 2023-04-12T14:51:28Z

Is that the needed VRAM or the model size? I can't run the medium model (1.5gb) on my 4GB GPU FWIW.
…
On Wed, Apr 12, 2023 at 11:41 PM ras0k @.> wrote: How much VRAM do you need for the large v2 model in Faster Whisper? That may limit its interest to users. 3.09 GB https://huggingface.co/guillaumekln/faster-whisper-large-v2/tree/main — Reply to this email directly, view it on GitHub <#6816 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/A5EWOZKP5CJG2Z4MK6YJN2DXA25KLANCNFSM6AAAAAAW2Y4LVI . You are receiving this because you commented.Message ID: @.>

oh sorry yes model size, I am not sure about VRAM use I will try right now but if you take the time to read the benchmarks I posted they say 4.8gb or 3.1gb depending on fp16 or int8

Large-v2 model on GPU

Implementation	Precision	Beam size	Time	Max. GPU memory	Max. CPU memory
openai/whisper	fp16	5	4m30s	11325MB	9439MB
faster-whisper	fp16	5	54s	4755MB	3244MB
faster-whisper	int8	5	59s	3091MB	3117MB

Executed with CUDA 11.7.1 on a NVIDIA Tesla V100S.

ras0k · 2023-04-12T15:07:42Z

Is that the needed VRAM or the model size? I can't run the medium model (1.5gb) on my 4GB GPU FWIW.
…
On Wed, Apr 12, 2023 at 11:41 PM ras0k @.> wrote: How much VRAM do you need for the large v2 model in Faster Whisper? That may limit its interest to users. 3.09 GB https://huggingface.co/guillaumekln/faster-whisper-large-v2/tree/main — Reply to this email directly, view it on GitHub <#6816 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/A5EWOZKP5CJG2Z4MK6YJN2DXA25KLANCNFSM6AAAAAAW2Y4LVI . You are receiving this because you commented.Message ID: @.>

just FYI right now I am testing large model on contMe and it's about 4.2GB so medium should run on your 4GB gpu, medium shows as about 2.3 GB usage max

ras0k · 2023-04-12T15:08:10Z

my GPU is a 2060 6GB

ras0k · 2023-04-12T15:10:22Z

for english medium.en is fine but for french not even large works so I really need large-v2 for it's multilingual capacities

ras0k · 2023-04-12T15:10:56Z

do you want me to compare the speed of ConstMe vs Faster-Whisper on large just for benchmarking purposes ?

Purfview · 2023-04-21T04:01:19Z

1:30 min transcription about 4 minutes vs 7 or so for Const-me using large v2 for both. I tested it through SubtitleEdit beta, not from the command line.

Edit: and Japanese works!

Nice! I would be interested in tests on longer sample and medium model.
Btw, do you find results from large-v2 model valuably better in comparison to medium?

Purfview · 2023-04-21T04:04:58Z

Btw, by default it sets threads to max real cores, it's probably not healthy on CPUs with a lot of cores.
Can someone with such CPU make the tests and report optimal threads setting?

rsmith02ct · 2023-04-21T04:05:07Z

Longer sample? It was about a 90 minute source video. Yes, for a few short samples I did large does much better with proper names, especially unusual ones.

…

On Fri, Apr 21, 2023 at 1:01 PM Purfview ***@***.***> wrote: 1:30 min transcription about 4 minutes vs 7 or so for Const-me using large v2 for both. I tested it through SubtitleEdit beta, not from the command line. Edit: and Japanese works! Nice! I would be interested in tests on longer sample and medium model. Btw, do you find results from large-v2 model valuably better in comparison to medium? — Reply to this email directly, view it on GitHub <#6816 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5EWOZI7YAXKDFPOF6BJYZTXCIBBTANCNFSM6AAAAAAW2Y4LVI> . You are receiving this because you were mentioned.Message ID: ***@***.***>

Purfview · 2023-04-21T04:08:01Z

Longer sample? It was about a 90 minute source video. Yes, for a few short samples I did large does much better with proper names, especially unusual ones.

Oh, I misread it. That's impressive performance you have there. :)

rsmith02ct · 2023-04-21T04:44:47Z

Yes, it was surprisingly quick! The RTX2080 Super was running at about 100%. For CPU I need to disable CUDA? Can you give me a command line setting to do what you need re: cores and threads? I have a 14 core CPU (13600K) and don't mind running it at 100% (I was using Subtitle Edit with CPP and running 3 at a time to max out the CPU- about 200W for ~10 hours at a time. No overheating).

…

On Fri, Apr 21, 2023 at 1:08 PM Purfview ***@***.***> wrote: Longer sample? It was about a 90 minute source video. Yes, for a few short samples I did large does much better with proper names, especially unusual ones. Oh, I misread it. That's impressive performance you have there. :) — Reply to this email directly, view it on GitHub <#6816 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5EWOZPSK5QSG5J3MJM4O7LXCIB2ZANCNFSM6AAAAAAW2Y4LVI> . You are receiving this because you were mentioned.Message ID: ***@***.***>

Purfview · 2023-04-21T05:06:33Z

For CPU I need to disable CUDA? Can you give me a command line setting to do what you need re: cores and threads? I have a 14 core CPU (13600K) and don't mind running it at 100% (I was using Subtitle Edit with CPP and running 3 at a time to max out the CPU- about 200W for ~10 hours at a time. No overheating).

Yes use CPU, and test --threads 14 --threads 10 --threads 8 --threads 4
Obviously only one task at a time. How much RAM do you have?

EDIT:
"100%" CPU usage doesn't mean optimal usage, programs can have same or better performance on for example 50% CPU usage.

BTW:
Actually I didn't "misread", you wrote "1:30 min", that means 90 seconds. :)

rsmith02ct · 2023-04-21T07:14:06Z

Using whisper-ctranslate2.exe --language en --model "medium" --device CPU --no_speech_threshold 0.2 on a 17 min file:

With 14 threads: 258 seconds

With 10 threads: seconds: 279s

With 8 threads: 290 seconds

With 4 threads: 292s

No thread limit: 258s

CUDA: 34s

Medium model isn't bad but has trouble with some words like cacao/cocoa and Cote D'Ivoire (the country).
For some runs it seemed to get stuck even though I wasn't doing anything else on the computer with 3x the time and low CPU usage. I did multiple runs and threw out the long ones.

Regarding efficiency, my goal was to load the CPU as much as possible and minimize my own time. Getting it to about 200W left nothing on the table (each instance of CPP seemed to occupy about 4 cores) and it worked away on batch transcriptions in 3 Subtitle Edits as I worked on other things with my laptop. Is it possible each one would have finished somewhat faster had I done them all in serial- sure. The CPU also heated the room that day- I left the ductless heat pump off.

Purfview · 2023-04-21T08:57:34Z

Interesting....

Please do 2 threads test.

rsmith02ct · 2023-04-21T09:03:42Z

Is this necessary? I've done quite a lot of tests. It will probably yield a worse time..

Purfview · 2023-04-21T09:05:08Z

Yes, it is.

rsmith02ct · 2023-04-21T09:32:05Z

2 threads. First run: 472s; second run 477s

Purfview · 2023-04-21T09:36:40Z

Thanks, looks like I need to set default to max 4 threads.
Interestingly, there is some boost in >8 threads.

rsmith02ct · 2023-04-21T09:40:28Z

Why limit it at all?

Purfview · 2023-04-21T09:42:33Z

Because it's a waste of electricity for nothing.

darnn · 2023-04-21T19:02:12Z

FWIW, over here, on a file that's just shy of three minutes (2:58), in Hebrew, with the large model:
WhisperDesktop: 59 seconds
The CPU build of Faster-Whisper linked above, using the command line, default settings: 267 seconds
2 threads: 326 seconds
4: 266
8: 262
16: 281

Though I will say that Faster-Whisper was more accurate in three or four words out of that file.

Purfview · 2023-04-21T22:43:40Z

@damn WhisperDesktop is Const-me, it runs on GPU.

darnn · 2023-04-22T07:24:49Z

Oh, yes, I didn't mean to suggest otherwise, it's just that the last time I tried Whisper-Faster, it wouldn't run the large model on the GPU at all, because I didn't have the 10 GB of memory I needed. But I just tried again with your standalone version, and it does indeed run it. I tried the default, 8 and 16 threads, and with all of them the results were 78-83 seconds. The output was still a little bit more accurate than Whisper-Desktop, but, strangely, a little less accurate than with your CPU build.

ras0k · 2023-04-24T07:47:28Z

A waste of electricity ? where is the electricity wasted ? either it is computing or it is not, there is no ‘‘waste’’ From: ***@***.***> Sent: April 21, 2023 5:42 AM To: ***@***.***> Cc: ***@***.***>; ***@***.***> Subject: Re: [SubtitleEdit/subtitleedit] faster-whisper integration ? (Issue #6816) Because it's a waste of electricity for nothing. — Reply to this email directly, view it on GitHub<#6816 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AG2HNCOOA67DE2XFPNCSVZDXCJJBHANCNFSM6AAAAAAW2Y4LVI>. You are receiving this because you authored the thread.Message ID: ***@***.***>

Purfview · 2023-04-26T06:09:47Z

Oh, yes, I didn't mean to suggest otherwise, it's just that the last time I tried Whisper-Faster, it wouldn't run the large model on the GPU at all, because I didn't have the 10 GB of memory I needed.

Thanks for tests. I think that then you've tested "OpenAI standalone".

But I just tried again with your standalone version, and it does indeed run it. I tried the default, 8 and 16 threads, and with all of them the results were 78-83 seconds. The output was still a little bit more accurate than Whisper-Desktop, but, strangely, a little less accurate than with your CPU build.

Some difference between GPU<->CPU results is normal.
Check new "b103" release, threads by default issue should be fixed, now it supports languages in the full names, added debug output with --verbose True, some few more parameters. [ Should be all parameters supported now]
"b103" means what is the last commit compiled in. I didn't noticed change in performance or results vs b94, at least with medium model.

A waste of electricity ? where is the electricity wasted ? either it is computing or it is not, there is no ‘‘waste’’

Yes. In CPU. That's not how multi-core CPUs work.

darnn · 2023-04-27T15:36:32Z

With the latest GPU release, with the default settings, it processes the same file in 55 seconds! So slightly faster than Whisper-Desktop now. Now, the question is, can I tweak any of the settings that would make it more accurate? I've never tried messing with any of these at all, and so I don't even really know what they are (beam size? something?), but as I said before, the CPU's output was slightly more accurate with its default settings.

Purfview · 2023-04-27T15:48:28Z

You can try to increase --beam_size, I set it to 1 by default.
Higher - slower transcription.

And you can try --vad_filter False. [VAD can skip some lines]

ras0k · 2023-04-28T06:10:40Z

In plain english (im sure if you ask chatgpt t will come up with a better explanation but i am lazy soiwill just type her einstead) : beam size means how large we are analysing so beamof 1 means we (by we i mean the whisper software) are reading one word at a time. With a beam size of 5 or 10 fr example we are reading and after every word we anlyse the past 5-10 words for larger context. This can lead to more accuracy if the content is predictable and coherent but less accuracy if the single words dont make global sense. Feel free to experiment, as stated before, larger beam = slower processing. From: ***@***.***> Sent: April 27, 2023 11:36 AM To: ***@***.***> Cc: ***@***.***>; ***@***.***> Subject: Re: [SubtitleEdit/subtitleedit] faster-whisper integration ? (Issue #6816) With the latest GPU release, with the default settings, it processes the same file in 55 seconds! So slightly faster than Whisper-Desktop now. Now, the question is, can I tweak any of the settings that would make it more accurate? I've never tried messing with any of these at all, and so I don't even really know what they are (beam size? something?), but as I said before, the CPU's output was slightly more accurate with its default settings. — Reply to this email directly, view it on GitHub<#6816 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AG2HNCORCDIEHIJUMP4U77LXDKHAVANCNFSM6AAAAAAW2Y4LVI>. You are receiving this because you authored the thread.Message ID: ***@***.***>

Purfview · 2023-05-13T16:38:05Z

@darnn @rsmith02ct @ras0k
Got rid of PyTorch in new "r117" release [that's why one small executable]. As you reported that CPU was more accurate then I matched precision in CUDA with CPU. Added bit more info when running on --verbose.

Could you test if it works OK on CUDA? Speed should similar, maybe slower than previous version.

darnn · 2023-05-13T16:56:02Z

Sounds interesting! I won't have time to test thoroughly in the next two or three days, but I will after that!

rsmith02ct · 2023-05-14T15:26:43Z

Could you share a link to the release so I don't have to look for it?

Purfview · 2023-05-14T17:15:33Z

https://github.com/Purfview/whisper-standalone-win/releases

darnn · 2023-05-14T20:51:37Z

Well, huh! I still haven't messed with all the different settings (is there anything other than beam_size that might improve accuracy?), but with the default settings, running through CUDA, it gives me the exact same time WhisperDesktop does for the file I used, 56 seconds.

rsmith02ct · 2023-05-15T05:55:41Z

Retesting with the 17 min file I tested above

From command line gives an error but then proceeds
2023-05-15 14:34:40.1329173 [W:onnxruntime:Default, onnxruntime_pybind_state.cc:1671 onnxruntime::python::CreateInferencePybindStateModule] Init provider bridge failed.

--language en --model "medium" --device CPU --no_speech_threshold 0.2
Time: 512s (seemed to get stuck more than once)
Second run: 297s

--language en --model "medium" --device CUDA --no_speech_threshold 0.2
Time: 34s

So basically as fast as before

Through SubtitleEdit beta "ctranslate2" engine
Time: 41s

rsmith02ct · 2023-05-15T06:19:32Z

Have you tested WhisperX? I can't actually install figure out how it (too complicated) but interested to see if it's got better timestamps
https://github.com/m-bain/whisperX

Purfview · 2023-05-15T06:21:37Z

Purfview/whisper-standalone-win#11

faster-whisper integration ? #6816

faster-whisper integration ? #6816

Comments

ras0k commented Apr 11, 2023 • edited Loading

Purfview commented Apr 11, 2023

Purfview commented Apr 11, 2023

ras0k commented Apr 11, 2023 • edited Loading

Large-v2 model on GPU

Small model on CPU

Purfview commented Apr 11, 2023

ras0k commented Apr 11, 2023 • edited Loading

Purfview commented Apr 11, 2023

ras0k commented Apr 11, 2023 • edited Loading

Purfview commented Apr 11, 2023

ras0k commented Apr 11, 2023

Purfview commented Apr 11, 2023

Purfview commented Apr 11, 2023

rsmith02ct commented Apr 12, 2023

ras0k commented Apr 12, 2023

ras0k commented Apr 12, 2023

ras0k commented Apr 12, 2023

rsmith02ct commented Apr 12, 2023 via email

ras0k commented Apr 12, 2023

ras0k commented Apr 12, 2023 • edited Loading

rsmith02ct commented Apr 12, 2023 via email

ras0k commented Apr 12, 2023

ras0k commented Apr 12, 2023 • edited Loading

rsmith02ct commented Apr 12, 2023 via email

ras0k commented Apr 12, 2023 • edited Loading

ras0k commented Apr 12, 2023 • edited Loading

rsmith02ct commented Apr 12, 2023 via email

ras0k commented Apr 12, 2023 • edited Loading

Large-v2 model on GPU

ras0k commented Apr 12, 2023

ras0k commented Apr 12, 2023

ras0k commented Apr 12, 2023

ras0k commented Apr 12, 2023

Purfview commented Apr 21, 2023

Purfview commented Apr 21, 2023

rsmith02ct commented Apr 21, 2023 via email

Purfview commented Apr 21, 2023

rsmith02ct commented Apr 21, 2023 via email

Purfview commented Apr 21, 2023 • edited Loading

rsmith02ct commented Apr 21, 2023

Purfview commented Apr 21, 2023 • edited Loading

rsmith02ct commented Apr 21, 2023

Purfview commented Apr 21, 2023

rsmith02ct commented Apr 21, 2023

Purfview commented Apr 21, 2023

rsmith02ct commented Apr 21, 2023

Purfview commented Apr 21, 2023

darnn commented Apr 21, 2023

Purfview commented Apr 21, 2023

darnn commented Apr 22, 2023

ras0k commented Apr 24, 2023 via email

Purfview commented Apr 26, 2023 • edited Loading

darnn commented Apr 27, 2023

Purfview commented Apr 27, 2023 • edited Loading

ras0k commented Apr 28, 2023 via email

Purfview commented May 13, 2023

darnn commented May 13, 2023

rsmith02ct commented May 14, 2023

Purfview commented May 14, 2023

darnn commented May 14, 2023

rsmith02ct commented May 15, 2023

rsmith02ct commented May 15, 2023

Purfview commented May 15, 2023

ras0k commented Apr 11, 2023 •

edited

Loading

ras0k commented Apr 11, 2023 •

edited

Loading

ras0k commented Apr 11, 2023 •

edited

Loading

ras0k commented Apr 11, 2023 •

edited

Loading

ras0k commented Apr 12, 2023 •

edited

Loading

ras0k commented Apr 12, 2023 •

edited

Loading

ras0k commented Apr 12, 2023 •

edited

Loading

ras0k commented Apr 12, 2023 •

edited

Loading

ras0k commented Apr 12, 2023 •

edited

Loading

Purfview commented Apr 21, 2023 •

edited

Loading

Purfview commented Apr 21, 2023 •

edited

Loading

Purfview commented Apr 26, 2023 •

edited

Loading

Purfview commented Apr 27, 2023 •

edited

Loading