Skip to content

Commit

Permalink
add model info
Browse files Browse the repository at this point in the history
  • Loading branch information
do-me committed Nov 5, 2023
1 parent 76c4165 commit f748870
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -367,7 +367,7 @@ <h4>Chat (Retrieval Augmented Generation, RAG)</h4>
results in the form: <br><b>"Based on the following input, answer the question: [your
question] [top search results]"</b>.<br> If you encounter errors, the input is probably
too long (either too many or too
long results or too long prompt). <br><b>Attention</b>: Loads very large models with more than 1.5Gb1 (!) of resources.</p>
long results or too long prompt). Xenova/LaMini-Flan-T5-783M is by far the best quantized model and delivers good results while the others produce nonsense in most cases. Also, make sure to check the right prompting style!<br><b>Attention</b>: Loads very large models with more than 1.5Gb1 (!) of resources.</p>
<div class="form-floating input-group mb-3">
<input id="chat_query" class="form-control"
value="What do these paragraphs have in common?">
Expand Down Expand Up @@ -441,7 +441,7 @@ <h4>Chat (Retrieval Augmented Generation, RAG)</h4>
<div>
<hr />
<h4>Summary (Retrieval Augmented Generation, RAG)</h4>
<p>Summarizes the top results in 1-2 min. Works best with non-fictional texts and longer text
<p>Summarizes the top search results. Works best with non-fictional texts and longer text
chunks (>200 chars).<br><b>Attention</b>: Loads very large models with hundreds of MB!</p>
<br>
<div class="d-flex flex-row">
Expand Down

0 comments on commit f748870

Please sign in to comment.