-
firecrawl Public
Forked from mendableai/firecrawl🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl, search and extract with a single API.
TypeScript GNU Affero General Public License v3.0 UpdatedMay 23, 2024 -
trafilatura Public
Forked from adbar/trafilaturaPython & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Python GNU General Public License v3.0 UpdatedFeb 22, 2024 -
self-llm Public
Forked from datawhalechina/self-llm《开源大模型食用指南》基于AutoDL快速部署开源大模型,更适合中国宝宝的部署教程
Jupyter Notebook Apache License 2.0 UpdatedJan 10, 2024 -
BaiduSpider Public
Forked from BaiduSpider/BaiduSpiderBaiduSpider,一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
Python GNU General Public License v3.0 UpdatedJan 10, 2024 -
ja3 Public
Forked from salesforce/ja3JA3 is a standard for creating SSL client fingerprints in an easy to produce and shareable way.
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 20, 2023 -
-
stf Public
Forked from DeviceFarmer/stfControl and manage Android devices from your browser.
JavaScript Other UpdatedJul 5, 2023 -
MiniGPT-4 Public
Forked from Vision-CAIR/MiniGPT-4MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
Python BSD 3-Clause "New" or "Revised" License UpdatedJun 7, 2023 -
CS-Base Public
Forked from xiaolincoder/CS-Base图解计算机网络、操作系统、计算机组成、数据库,共 1000 张图 + 50 万字,破除晦涩难懂的计算机基础知识,让天下没有难懂的八股文!🚀 在线阅读:https://xiaolincoding.com
UpdatedMay 28, 2023 -
readability Public
Forked from mozilla/readabilityA standalone version of the readability lib
JavaScript Other UpdatedMay 19, 2023 -
-
python-readability Public
Forked from buriy/python-readabilityfast python port of arc90's readability tool, updated to match latest readability.js!
Python UpdatedApr 24, 2023 -
X-Bogus Public
Forked from luoyanhan/X-BogusTikTok X-Bogus Signature Generator.
-
webspot Public
Forked from crawlab-team/webspotAn intelligent web service to automatically detect web content and extract information from it.
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedMar 23, 2023 -
CommNewsExtractor Public
Forked from kingking888/CommNewsExtractor基于文本密度 通用文章提取,正文,标题,时间,作者,图片,音视频,联系方式等
Python MIT License UpdatedMar 19, 2023 -
Text_select_captcha Public
Forked from ArchClass/Text_select_captchapytorch实现文字点选、选字、选择文字验证码识别
Python UpdatedMar 2, 2023 -
newsworker Public
Forked from ivbeg/newsworkerAdvanced news feeds extractor and finder library. Helps to automatically extract news from websites without RSS/ATOM feeds
Python MIT License UpdatedOct 2, 2022 -
GeneralNewsExtractor Public
Forked from GeneralNewsExtractor/GeneralNewsExtractor新闻网页正文通用抽取器 Beta 版.
Python GNU General Public License v3.0 UpdatedSep 25, 2022 -
GerapyAutoExtractor Public
Forked from Gerapy/GerapyAutoExtractorAuto Extractor Module
Python Apache License 2.0 UpdatedJun 30, 2022 -
Political-News-Filter Public
Forked from lukasgebhard/Political-News-Filter新闻政治分类器
Python Apache License 2.0 UpdatedJun 25, 2022 -
requests Public
Forked from wangluozhe/requestsUsed to quickly request HTTP or HTTPS
Go UpdatedMar 17, 2022 -
-
python-goose Public
Forked from grangier/python-gooseHtml Content / Article Extractor, web scrapping lib in Python
HTML Apache License 2.0 UpdatedDec 26, 2021 -
ast-hook-for-js-RE Public
Forked from JSREI/ast-hook-for-js-RE浏览器内存漫游解决方案(探索中...)
JavaScript Other UpdatedSep 23, 2021 -
-
WeiboSpider Public
Forked from CharesFang/WeiboSpider新浪微博爬虫,一个基于Scrapy框架的迷你微博爬虫,Sina Weibo Spider
Python GNU General Public License v3.0 UpdatedJul 25, 2021 -
crawler-js-hook-framework-public Public
Forked from JSREI/crawler-js-hook-framework-publicJavaScript UpdatedJun 20, 2021 -
cnn_captcha Public
Forked from nickliqian/cnn_captchause cnn recognize captcha by tensorflow. 本项目针对字符型图片验证码,使用tensorflow实现卷积神经网络,进行验证码识别。
Python Apache License 2.0 UpdatedJun 8, 2021 -
Review_Reverse Public
Forked from tcc0lin/Review_Reverse👋2019年末总结下今年做过的逆向,整理代码,复习思路。🙏拼夕夕Web端anti_content参数逆向分析👺 WEB淘宝sign逆向分析;😺努比亚Cookie生成逆向分析;🙌百度指数data加密逆向分析 👣今日头条WEB端_signature、as、cp参数逆向分析🎶知乎登录formdata加密逆向分析 🤡KNN猫眼字体反爬👅Boss直聘Cookie加密字段__zp_stoken__逆向分析
JavaScript UpdatedJun 8, 2021 -
hooker Public
Forked from CreditTone/hooker🔥🔥hooker是一个基于frida实现的逆向工具包。为逆向开发人员提供统一化的脚本包管理方式、通杀脚本、自动化生成hook脚本、内存漫游探测activity和service和其他任意对象。
JavaScript UpdatedApr 27, 2021