- YouTube extraction with transcript support - Instagram reel extraction via browser automation - Blog/article web scraping - Auto-save to Obsidian vaults - Smart key point generation - Configurable via .env file - Quick extract shell script Tech stack: Python, requests, beautifulsoup4, playwright, youtube-transcript-api
24 lines
359 B
Plaintext
24 lines
359 B
Plaintext
# Content Extractor Dependencies
|
|
|
|
# Web scraping
|
|
requests>=2.31.0
|
|
beautifulsoup4>=4.12.0
|
|
lxml>=4.9.0
|
|
|
|
# YouTube
|
|
youtube-transcript-api>=0.6.0
|
|
pytube>=15.0.0
|
|
|
|
# Browser automation (for Instagram and dynamic content)
|
|
playwright>=1.40.0
|
|
|
|
# Text processing
|
|
markdown>=3.5.0
|
|
|
|
# Utilities
|
|
python-dotenv>=1.0.0
|
|
pydantic>=2.5.0
|
|
|
|
# Date handling
|
|
python-dateutil>=2.8.0
|