LinXin04

jane LinXin04

1 follower · 2 following

Stars

unclecode / crawl4ai

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper

Python 28,151 2,227 Updated Jan 31, 2025

HawkClaws / main_content_extractor

A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.

Python 29 1 Updated May 16, 2024

ChenTaHung / HTML-Text-Parser

This project is designed to extract text from documents and prepare it for processing by Large Language Models (LLM). Implemented a feature to store and utilize text style information, enabling the…

HTML 8 1 Updated Nov 17, 2024

parsee-ai / parsee-core

Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular data extraction and multimodal queries.

Python 66 1 Updated Jan 31, 2025

THUDM / AutoWebGLM

An LLM-based Web Navigating Agent (KDD'24)

Python 801 67 Updated Sep 27, 2024

sunyongdi / llm_classification

大模型文本分类

Python 31 3 Updated Aug 15, 2024

muyaostudio / qwen2_seq_cls

使用 Qwen2ForSequenceClassification 简单实现文本分类任务。

Python 47 2 Updated Jun 12, 2024

DSXiangLi / DecryptPrompt

总结Prompt&LLM论文，开源数据&模型，AIGC应用

2,817 285 Updated Jan 26, 2025

kuangkzh / transformers-re

A Regular Expression constraint for Language Models of transformers. With this module, you can force the LLMs to generate following your regex. Using regex in tokens and tensors are also implemente…

Python 7 Updated May 14, 2024

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 8,338 609 Updated Jan 23, 2025

hfawaz / dl-4-tsc

Deep Learning for Time Series Classification

Python 1,584 572 Updated Mar 18, 2023

georgian-io / Multimodal-Toolkit

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

Python 601 88 Updated Oct 30, 2024

naveirmd / Multimodal_RNN_Python_Code

This Python code was developed to create RNNs for analyzing time-intensive multimodal process data and non-time-series data.

Python 1 Updated Aug 2, 2022

jrzaurin / pytorch-widedeep

A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

Python 1,317 193 Updated Nov 6, 2024

Luodian / Generalizable-Mixture-of-Experts

GMoE could be the next backbone model for many kinds of generalization task.

Python 265 35 Updated Mar 21, 2023

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,152 3,419 Updated Jul 23, 2024

megvii-research / NAFNet

The state-of-the-art image restoration model without nonlinear activation functions.

Python 2,340 298 Updated Jul 3, 2024

Data-Science-kosta / Long-texts-Sentiment-Analysis-RoBERTa

PyTorch implementation of Sentiment Analysis of the long texts written in Serbian language (which is underused language) using pretrained Multilingual RoBERTa based model (XLM-R) on the small dataset.

Jupyter Notebook 26 7 Updated Nov 20, 2022

UrosOgrizovic / RobertaPretraining

Python 4 Updated Apr 7, 2022

emarkou / multilingual-bert-text-classification

text classification using mbert

Jupyter Notebook 19 4 Updated Apr 19, 2021

ThilinaRajapakse / simpletransformers

Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

Python 4,145 725 Updated May 29, 2024

ThilinaRajapakse / pytorch-transformers-classification

Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for employing Transformer models in text classification tasks. Contains code to easily train BERT, XLNet, Ro…

Jupyter Notebook 306 97 Updated May 9, 2020

miemie2013 / Pytorch-PPYOLO

ppyolo in pytorch. 44.8% box mAP.

Python 106 27 Updated Dec 19, 2021

wainshine / Chinese-Names-Corpus

中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。

4,054 1,003 Updated Mar 27, 2024

iam-mhaseeb / Multi-Layer-Perceptron-MNIST-with-PyTorch

This repository is MLP implementation of classifier on MNIST dataset with PyTorch

Jupyter Notebook 38 26 Updated Dec 1, 2018

ZisisFl / MLPClassifier-Neural-Network-Titanic-Kaggle

An approach with neural networks to the Titanic Kaggle problem, using MLPClassifier from sklearn.

Python 3 Updated Sep 29, 2018

BenjiKCF / Tabular-data-Winning-Solution

Rank gaussian normalization, Swap noise, Denoised AutoEncoder as feature engineering

Jupyter Notebook 10 3 Updated Nov 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly