Skip to content
View oldsiks's full-sized avatar

Block or report oldsiks

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • firecrawl Public

    Forked from mendableai/firecrawl

    🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl, search and extract with a single API.

    TypeScript GNU Affero General Public License v3.0 Updated May 23, 2024
  • trafilatura Public

    Forked from adbar/trafilatura

    Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

    Python GNU General Public License v3.0 Updated Feb 22, 2024
  • 《开源大模型食用指南》基于AutoDL快速部署开源大模型,更适合中国宝宝的部署教程

    Jupyter Notebook Apache License 2.0 Updated Jan 10, 2024
  • BaiduSpider,一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。

    Python GNU General Public License v3.0 Updated Jan 10, 2024
  • ja3 Public

    Forked from salesforce/ja3

    JA3 is a standard for creating SSL client fingerprints in an easy to produce and shareable way.

    Python BSD 3-Clause "New" or "Revised" License Updated Oct 20, 2023
  • tweepy Public

    Forked from tweepy/tweepy

    Twitter for Python!

    Python MIT License Updated Aug 26, 2023
  • stf Public

    Forked from DeviceFarmer/stf

    Control and manage Android devices from your browser.

    JavaScript Other Updated Jul 5, 2023
  • MiniGPT-4 Public

    Forked from Vision-CAIR/MiniGPT-4

    MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

    Python BSD 3-Clause "New" or "Revised" License Updated Jun 7, 2023
  • CS-Base Public

    Forked from xiaolincoder/CS-Base

    图解计算机网络、操作系统、计算机组成、数据库,共 1000 张图 + 50 万字,破除晦涩难懂的计算机基础知识,让天下没有难懂的八股文!🚀 在线阅读:https://xiaolincoding.com

    Updated May 28, 2023
  • readability Public

    Forked from mozilla/readability

    A standalone version of the readability lib

    JavaScript Other Updated May 19, 2023
  • mlscraper Public

    Forked from lorey/mlscraper

    自动识别 列表页

    Python Updated Apr 29, 2023
  • fast python port of arc90's readability tool, updated to match latest readability.js!

    Python Updated Apr 24, 2023
  • X-Bogus Public

    Forked from luoyanhan/X-Bogus

    TikTok X-Bogus Signature Generator.

    JavaScript 1 Updated Apr 20, 2023
  • webspot Public

    Forked from crawlab-team/webspot

    An intelligent web service to automatically detect web content and extract information from it.

    Jupyter Notebook BSD 3-Clause "New" or "Revised" License Updated Mar 23, 2023
  • 基于文本密度 通用文章提取,正文,标题,时间,作者,图片,音视频,联系方式等

    Python MIT License Updated Mar 19, 2023
  • pytorch实现文字点选、选字、选择文字验证码识别

    Python Updated Mar 2, 2023
  • newsworker Public

    Forked from ivbeg/newsworker

    Advanced news feeds extractor and finder library. Helps to automatically extract news from websites without RSS/ATOM feeds

    Python MIT License Updated Oct 2, 2022
  • 新闻网页正文通用抽取器 Beta 版.

    Python GNU General Public License v3.0 Updated Sep 25, 2022
  • Auto Extractor Module

    Python Apache License 2.0 Updated Jun 30, 2022
  • 新闻政治分类器

    Python Apache License 2.0 Updated Jun 25, 2022
  • requests Public

    Forked from wangluozhe/requests

    Used to quickly request HTTP or HTTPS

    Go Updated Mar 17, 2022
  • ast_tools Public

    Forked from sml2h3/ast_tools

    ast基础框架-基于babel

    JavaScript Updated Mar 11, 2022
  • Html Content / Article Extractor, web scrapping lib in Python

    HTML Apache License 2.0 Updated Dec 26, 2021
  • 浏览器内存漫游解决方案(探索中...)

    JavaScript Other Updated Sep 23, 2021
  • 影片数据分析

    Updated Sep 3, 2021
  • 新浪微博爬虫,一个基于Scrapy框架的迷你微博爬虫,Sina Weibo Spider

    Python GNU General Public License v3.0 Updated Jul 25, 2021
  • JavaScript Updated Jun 20, 2021
  • use cnn recognize captcha by tensorflow. 本项目针对字符型图片验证码,使用tensorflow实现卷积神经网络,进行验证码识别。

    Python Apache License 2.0 Updated Jun 8, 2021
  • 👋2019年末总结下今年做过的逆向,整理代码,复习思路。🙏拼夕夕Web端anti_content参数逆向分析👺 WEB淘宝sign逆向分析;😺努比亚Cookie生成逆向分析;🙌百度指数data加密逆向分析 👣今日头条WEB端_signature、as、cp参数逆向分析🎶知乎登录formdata加密逆向分析 🤡KNN猫眼字体反爬👅Boss直聘Cookie加密字段__zp_stoken__逆向分析

    JavaScript Updated Jun 8, 2021
  • hooker Public

    Forked from CreditTone/hooker

    🔥🔥hooker是一个基于frida实现的逆向工具包。为逆向开发人员提供统一化的脚本包管理方式、通杀脚本、自动化生成hook脚本、内存漫游探测activity和service和其他任意对象。

    JavaScript Updated Apr 27, 2021