Skip to content

Repo for the Knight Center journalism in the Americas BOC course "Generative AI for journalists: Discovering what data can do"

Notifications You must be signed in to change notification settings

jheasly/generative-ai-for-journalists

Repository files navigation

Note

The Python version to be used is specified in the .python-version file. It's a pyenv thing.

There's a weekly course material zip file in each of the Week_2, Week_3, Week_4 directories that you'll need.


Week 1

YouTube videos

What is Modelling? Part 1 | Module 1
https://www.youtube.com/watch?v=EuukvuoFMvg

What is Modelling? Part 2 | Module 1
https://www.youtube.com/watch?v=Qhr7iu7tQ_c


Week 2

YouTube videos

Discovering your Data - Part 1 | Module 2
https://www.youtube.com/watch?v=rIrwBvVQVPg
(Intro to the concept of embedding)

Discovering your Data - Part 2 | Module 2 https://www.youtube.com/watch?v=vuI9VlBdyAM
(Retrieval Augmented Generation, actual embedding using LangChain, store embeddings in Chroma)


Week 3

YouTube videos

Working with Models - Part 1 | Module 3
https://www.youtube.com/watch?v=G6QwZFHvHD8
(feeding documents into Mistral LLM; "agent," "tool")

Working with Models - Part 2 | Module 3
https://www.youtube.com/watch?v=F4Y1h2M4-04
(Mistral 7B)

Working with Models - Part 3 | Module 3
https://www.youtube.com/watch?v=4o9rQRg4cnM
(inserting a /directory full of documents into a vector database; load with LangChain's PyPDFDirectoryLoader, then embed the CSVs & PDFs with LangChain embedding classes, then install and load vectors into Chroma, then run a few queries. And now at 6:17 start Week 3 ...
Start by making an Agent, 8:15.)


Colab notebook

https://colab.research.google.com/drive/1uhm63oZziGMXAAeBQzaElcDWLF7OeWFq


Week 3 LangChain link

Modules | Agents

https://python.langchain.com/docs/modules/agents/


Week 4

YouTube videos

Making a Website - Part 1 | Module 4
https://www.youtube.com/watch?v=qnAD2P76YxE [7:33]
(intro to Hugging Face library Gradio)

Making a Website - Part 2 | Module 4
https://www.youtube.com/watch?v=cn7OmJQS1SI [4:46]
(futzing around with Hugging Face and Gradio websites)

Making a Website - Part 3 | Module 4 [8:31]
https://www.youtube.com/watch?v=u188e2LKdC8
(going through Week 4 Jupyter notebook)

Interview with John Keefe | Module 4
https://www.youtube.com/watch?v=Q7GIEh47Rls

Interview with Freddy Boulton | Module 4
https://www.youtube.com/watch?v=gavp14FR38Y


Colab notebook

https://colab.research.google.com/drive/1RQVyHHaEzYLyJqXpeI7-Ujq9r_DSHW4-


Referenced LangChain links

Modules | Retrieval

https://python.langchain.com/docs/modules/data_connection

Modules | Retrieval | Document loaders | PDF

https://python.langchain.com/docs/modules/data_connection/document_loaders/pdf

Modules | Retrieval | Document loaders | CSV

https://python.langchain.com/docs/modules/data_connection/document_loaders/csv

Components | Text embedding models | Sentence Transformers on Hugging Face

(This is slightly different than the link in video/course.)

https://python.langchain.com/docs/integrations/text_embedding/sentence_transformers

Modules | Retrieval | Vector stores

https://python.langchain.com/docs/modules/data_connection/vectorstores/


There's also a Hugging Face Discourse/Discord?

Knight Center YouTube channel

https://www.youtube.com/@KnightCenterCourses

About

Repo for the Knight Center journalism in the Americas BOC course "Generative AI for journalists: Discovering what data can do"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published