22. Web Scraping
22. Web Scraping
Nota atomica extraida del capitulo "22. Web Scraping" del master osint-references-master.
Contenido (de OSINT Bible)
| Tool | Function | URL | Level |
|---|---|---|---|
| Photon | Ultra-fast crawler | https://github.com/s0md3v/Photon | Intermediate |
| Scrapy | Complete framework | https://scrapy.org | Advanced |
| Playwright | Browser automation | https://playwright.dev | Advanced |
| Selenium | Classic automation | https://www.selenium.dev | Intermediate |
| Beautiful Soup | HTML/XML parser | https://www.crummy.com/software/BeautifulSoup | Basic |
Basic Photon script:
python photon.py -u https://target.com \
--export=json \
--dns \
--keys \
--threads 10