feat: Initial commit - Content Extractor for YouTube, Instagram, and blogs
- YouTube extraction with transcript support - Instagram reel extraction via browser automation - Blog/article web scraping - Auto-save to Obsidian vaults - Smart key point generation - Configurable via .env file - Quick extract shell script Tech stack: Python, requests, beautifulsoup4, playwright, youtube-transcript-api
This commit is contained in:
21
.env.example
Normal file
21
.env.example
Normal file
@@ -0,0 +1,21 @@
|
||||
# Content Extractor Configuration
|
||||
|
||||
# Obsidian vault path (default: ~/Obsidian Vault)
|
||||
OBSIDIAN_VAULT_PATH=~/Obsidian Vault
|
||||
|
||||
# Browser settings (for Instagram extraction)
|
||||
BROWSER_HEADLESS=true
|
||||
BROWSER_TIMEOUT=30000
|
||||
|
||||
# Content extraction settings
|
||||
MAX_CONTENT_LENGTH=10000
|
||||
GENERATE_SUMMARY=true
|
||||
|
||||
# YouTube settings
|
||||
YOUTUBE_LANGUAGE=en
|
||||
|
||||
# Instagram settings
|
||||
INSTAGRAM_WAIT_TIME=5
|
||||
|
||||
# Logging
|
||||
LOG_LEVEL=INFO
|
||||
Reference in New Issue
Block a user