add model info

do-me · Nov 5, 2023 · f748870 · f748870
1 parent 76c4165
commit f748870
Showing 1 changed file with 2 additions and 2 deletions.
diff --git a/index.html b/index.html
@@ -367,7 +367,7 @@ <h4>Chat (Retrieval Augmented Generation, RAG)</h4>
                             results in the form: <br><b>"Based on the following input, answer the question: [your
                                 question] [top search results]"</b>.<br> If you encounter errors, the input is probably
                             too long (either too many or too
-                            long results or too long prompt). <br><b>Attention</b>: Loads very large models with more than 1.5Gb1 (!) of resources.</p>
+                            long results or too long prompt). Xenova/LaMini-Flan-T5-783M is by far the best quantized model and delivers good results while the others produce nonsense in most cases. Also, make sure to check the right prompting style!<br><b>Attention</b>: Loads very large models with more than 1.5Gb1 (!) of resources.</p>
                         <div class="form-floating input-group mb-3">
                             <input id="chat_query" class="form-control"
                                 value="What do these paragraphs have in common?">
@@ -441,7 +441,7 @@ <h4>Chat (Retrieval Augmented Generation, RAG)</h4>
                     <div>
                         <hr />
                         <h4>Summary (Retrieval Augmented Generation, RAG)</h4>
-                        <p>Summarizes the top results in 1-2 min. Works best with non-fictional texts and longer text
+                        <p>Summarizes the top search results. Works best with non-fictional texts and longer text
                             chunks (>200 chars).<br><b>Attention</b>: Loads very large models with hundreds of MB!</p>
                         <br>
                         <div class="d-flex flex-row">