From 5d794428e1a564fb49b7096fb3e35047385afc17 Mon Sep 17 00:00:00 2001 From: Emmett McFaralne Date: Fri, 19 Jul 2024 19:08:11 -0400 Subject: [PATCH] declared known issue for pytube downloads --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 897afc1..0795fd7 100644 --- a/README.md +++ b/README.md @@ -91,7 +91,7 @@ thepipe path/to/folder --include_regex .*\.tsx | Image | `.jpg`, `.jpeg`, `.png` | ✔️ | Uses pytesseract for OCR in text-only mode | | ZIP File | `.zip` | ✔️ | Extracts and processes contained files | | Directory | any `path/to/folder` | ✔️ | Recursively processes all files in directory | -| YouTube Video | YouTube video URLs starting with `https://youtube.com` or `https://www.youtube.com`. | ✔️ | Uses pytube for video download and Whisper for transcription. For consistent extraction, you may need to modify your `pytube` installation to send a valid user agent header (see [this issue](https://github.com/pytube/pytube/issues/399)). | +| YouTube Video (known issues) | YouTube video URLs starting with `https://youtube.com` or `https://www.youtube.com`. | ✔️ | Uses pytube for video download and Whisper for transcription. For consistent extraction, you may need to modify your `pytube` installation to send a valid user agent header (see [this issue](https://github.com/pytube/pytube/issues/399)). | | Tweet | URLs starting with `https://twitter.com` or `https://x.com` | ✔️ | Uses unofficial API, may break unexpectedly | | GitHub Repository | GitHub repo URLs starting with `https://github.com` or `https://www.github.com` | ✔️ | Requires GITHUB_TOKEN environment variable |