Commit Graph

12 Commits

Author SHA1 Message Date
Jan Bader
a9bb2460c6 convert to backblaze fetcher 2026-04-05 22:01:50 +02:00
Jan Bader
66e1c9e0e0 Improve extraction 2026-04-04 21:16:28 +02:00
Jan Bader
40104dc0f9 Add logging 2026-04-04 21:10:33 +02:00
Jan Bader
684f7c87e6 Harder ignoring of ui prompts 2026-04-04 21:00:56 +02:00
Jan Bader
d32c696f6e Also use AI for the content 2026-04-04 20:56:44 +02:00
Jan Bader
1c719f4381 load .env in dev shell 2026-04-04 20:51:10 +02:00
Jan Bader
0163767dd1 Add AI summarization 2026-04-04 20:50:59 +02:00
Jan Bader
db44427c1f Ignore some ui prompts 2026-04-04 20:46:48 +02:00
Jan Bader
99ba4f6ac8 Ignore language list 2026-04-04 20:41:00 +02:00
Jan Bader
75a4ab20fd add python deps & playwright 2026-04-04 20:38:40 +02:00
Jan Bader
d343a48af1 Add flake 2026-04-04 20:31:37 +02:00
naki
c997e764b5 feat: Initial commit - Content Extractor for YouTube, Instagram, and blogs
- YouTube extraction with transcript support
- Instagram reel extraction via browser automation
- Blog/article web scraping
- Auto-save to Obsidian vaults
- Smart key point generation
- Configurable via .env file
- Quick extract shell script

Tech stack: Python, requests, beautifulsoup4, playwright, youtube-transcript-api
2026-03-05 13:02:58 +05:30