Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

refactor(community): update YoutubeLoader implementation #7477

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

sinedied
Copy link
Contributor

@sinedied sinedied commented Jan 7, 2025

Changes

  • Removes youtube-transcript dependency
  • Updates youtubei.js dependency and made peer dependency versioning lenient
  • Refactored YoutubeLoader to use youtubei.js Innertube API instead or relying on youtube-transcript web scraping (may solve some issues regarding YT IP banning)
  • Added test for YoutubeLoader

As YT transcript loading is often broken and requires frequent package updates, this change will (hopefully) simplify the maintenance by now relying on a single package that can be updated independently of @langchain/community package.

Copy link

vercel bot commented Jan 7, 2025

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
langchainjs-docs ✅ Ready (Inspect) Visit Preview Jan 7, 2025 10:51am
1 Skipped Deployment
Name Status Preview Comments Updated (UTC)
langchainjs-api-refs ⬜️ Ignored (Inspect) Jan 7, 2025 10:51am

@dosubot dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. auto:refactor A large refactor of a feature(s) or restructuring of many files labels Jan 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
auto:refactor A large refactor of a feature(s) or restructuring of many files size:M This PR changes 30-99 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant