dsorato

Follow

Danielly Sorato dsorato

Follow

Computational linguist/sociolinguist; PhD candidate at UPF.

9 followers · 14 following

Universidad Pompeu Fabra
Barcelona

Stars

escline / InstallCert

Java program to retrieve server certificate that can be added to local keystore

Java 885 857 Updated Jun 10, 2022

linuxserver / docker-swag

Nginx webserver and reverse proxy with php support and a built-in Certbot (Let's Encrypt) client. It also contains fail2ban for intrusion prevention.

Dockerfile 2,945 247 Updated Dec 14, 2024

Unbabel / COMET

A Neural Framework for MT Evaluation

Python 515 82 Updated Dec 5, 2024

zwhe99 / MAPS-mt

[TACL 2024] MAPS enables LLMs🤖 to mimic the human😁 translation process.

Python 136 5 Updated Jun 7, 2024

mjpost / sacrebleu

Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons

Python 1,085 165 Updated Aug 17, 2024

ajdavidl / Portuguese-NLP

List of resources and tools developed with focus on Portuguese.

251 28 Updated Nov 5, 2024

22-hours / cabrita

Finetuning InstructLLaMA with portuguese data

Jupyter Notebook 557 68 Updated Jun 6, 2023

chozelinek / europarl

Toolkit to compile a comparable/parallel corpus from European Parliament proceedings

Python 15 4 Updated Jan 26, 2020

kanekomasahiro / evaluate_bias_in_mlm

Python 13 3 Updated Dec 1, 2021

katyfelkner / winoqueer

Python 13 Updated Sep 5, 2024

LivNLP / bias-sense

Python 8 Updated Mar 12, 2022

vecto-ai / word-benchmarks

Benchmarks for intrinsic word embeddings evaluation.

60 25 Updated Jun 25, 2018

MLforHealth / HurtfulWords

Quantifying biases in BERT embeddings pretrained on MIMIC-III clinical notes

Python 21 9 Updated Mar 26, 2021

dmlc / gluon-nlp

NLP made easy

Python 2,560 534 Updated Oct 6, 2023

awslabs / mlm-scoring

Python library & examples for Masked Language Model Scoring (ACL 2020)

Python 337 59 Updated Dec 20, 2022

huggingface / evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Python 2,063 263 Updated Sep 17, 2024

huggingface / ethics-scripts

Python 14 10 Updated Apr 20, 2023

briceno-rosas / ESS-Interim-Dataset

R 1 1 Updated Jan 26, 2024

IQSS / Amelia

Amelia: A Package for Missing Data

R 63 15 Updated Nov 7, 2024

lizaku / vec2graph

Mini-library for producing graph visualizations from embedding models

Python 28 2 Updated Sep 10, 2020

roccotrip / antisem

Tracing Antisemitic Language Through Diachronic Embedding Projections: France 1789-1914

Python 2 Updated Aug 1, 2019

liferay / liferay-ide

Java 131 168 Updated Dec 4, 2024

huggingface / datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 19,359 2,710 Updated Dec 13, 2024

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,381 27,314 Updated Dec 17, 2024

Noixas / Official-Evaluating-Bias-In-Dutch

Relevant data for the paper Evaluating Bias in Dutch Word Embeddings.

4 Updated Jan 4, 2021

dccuchile / wefe

WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in…

Python 175 14 Updated Jun 18, 2024

deep-spin / uncertainties_MT_eval

Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.

Python 23 3 Updated Jun 23, 2023

fail2ban / fail2ban

Daemon to ban hosts that cause multiple authentication errors

Python 12,607 1,260 Updated Dec 16, 2024

liferay / liferay-blade-samples

Java 156 468 Updated Oct 27, 2023

babylonhealth / fastText_multilingual

Multilingual word vectors in 78 languages

Jupyter Notebook 1,197 121 Updated Mar 10, 2023