Stars
Built a multilingual text classification model to predict the probability that a comment is toxic using the data provided by Google Jigsaw.
Multilingual hate speech detection using pre-trained BERT and XLM-RoBERTa models with prompt tuning.
Multilingual Named Entity Recognition by XLM-Roberta model with CRF
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training (ACL 2023))
Language-Agnostic SEntence Representations
PyTorch original implementation of Cross-lingual Language Model Pretraining.
Named Entity Recognition with Pretrained XLM-RoBERTa
Dataset Availabilty for Application of Word embedding and Deep learning in detecting phishing emails
End-to-end implementation of Spam Detection in Email using Machine Learning, Python, Flask, Gunicorn, Scikit-Learn, and Logistic Regression on the Heroku cloud application platform.
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and…
merico-dev / lake
Forked from apache/incubator-devlakeDevLake: the open-source dev data platform & dashboard for your DevOps tools. *Note*: We have moved to Apache Software Foundation https://github.com/apache/incubator-devlake.
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many mo…
This repository contains the implementation of spam email detection using Dataset.csv, Dataset has spam and ham labels and associated text. I am using Deep Learning to detect Spam /Ham emails.
the Email spam detection predicts whether email is spam or not. dataset from kaggle is used to classify email using text pre-processing techniques, and Naive Bayes algorithm.
Classified messages as Spam or Ham using NLTK and Scikit-learn
A topic-centric list of HQ open datasets.
clash节点、免费clash节点、免费节点、免费梯子、clash科学上网、clash翻墙、clash订阅链接、clash for Windows、clash教程、免费公益节点、最新clash免费节点订阅地址、clash免费节点每日更新
Text-to-SQL task, with BERT for sentence embedding and GAT for syntax info extraction.
Implementation for AAAI workshop 2022
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.