Skip to content
View LinXin04's full-sized avatar

Block or report LinXin04

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper

Python 28,151 2,227 Updated Jan 31, 2025
Python 369 31 Updated Nov 22, 2024

A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.

Python 29 1 Updated May 16, 2024

This project is designed to extract text from documents and prepare it for processing by Large Language Models (LLM). Implemented a feature to store and utilize text style information, enabling the…

HTML 8 1 Updated Nov 17, 2024

Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular data extraction and multimodal queries.

Python 66 1 Updated Jan 31, 2025

An LLM-based Web Navigating Agent (KDD'24)

Python 801 67 Updated Sep 27, 2024

大模型文本分类

Python 31 3 Updated Aug 15, 2024

使用 Qwen2ForSequenceClassification 简单实现文本分类任务。

Python 47 2 Updated Jun 12, 2024

总结Prompt&LLM论文,开源数据&模型,AIGC应用

2,817 285 Updated Jan 26, 2025

A Regular Expression constraint for Language Models of transformers. With this module, you can force the LLMs to generate following your regex. Using regex in tokens and tensors are also implemente…

Python 7 Updated May 14, 2024

Retrieval and Retrieval-augmented LLMs

Python 8,338 609 Updated Jan 23, 2025
Python 77 7 Updated Aug 11, 2023

Deep Learning for Time Series Classification

Python 1,584 572 Updated Mar 18, 2023

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

Python 601 88 Updated Oct 30, 2024

This Python code was developed to create RNNs for analyzing time-intensive multimodal process data and non-time-series data.

Python 1 Updated Aug 2, 2022

A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

Python 1,317 193 Updated Nov 6, 2024

GMoE could be the next backbone model for many kinds of generalization task.

Python 265 35 Updated Mar 21, 2023

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,152 3,419 Updated Jul 23, 2024

The state-of-the-art image restoration model without nonlinear activation functions.

Python 2,340 298 Updated Jul 3, 2024

PyTorch implementation of Sentiment Analysis of the long texts written in Serbian language (which is underused language) using pretrained Multilingual RoBERTa based model (XLM-R) on the small dataset.

Jupyter Notebook 26 7 Updated Nov 20, 2022
Python 4 Updated Apr 7, 2022

text classification using mbert

Jupyter Notebook 19 4 Updated Apr 19, 2021

Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

Python 4,145 725 Updated May 29, 2024

Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for employing Transformer models in text classification tasks. Contains code to easily train BERT, XLNet, Ro…

Jupyter Notebook 306 97 Updated May 9, 2020

ppyolo in pytorch. 44.8% box mAP.

Python 106 27 Updated Dec 19, 2021

中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。

4,054 1,003 Updated Mar 27, 2024

This repository is MLP implementation of classifier on MNIST dataset with PyTorch

Jupyter Notebook 38 26 Updated Dec 1, 2018

An approach with neural networks to the Titanic Kaggle problem, using MLPClassifier from sklearn.

Python 3 Updated Sep 29, 2018

Rank gaussian normalization, Swap noise, Denoised AutoEncoder as feature engineering

Jupyter Notebook 10 3 Updated Nov 17, 2020
Next