-
Universidad Pompeu Fabra
- Barcelona
Stars
Java program to retrieve server certificate that can be added to local keystore
Nginx webserver and reverse proxy with php support and a built-in Certbot (Let's Encrypt) client. It also contains fail2ban for intrusion prevention.
[TACL 2024] MAPS enables LLMs🤖 to mimic the human😁 translation process.
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
List of resources and tools developed with focus on Portuguese.
Finetuning InstructLLaMA with portuguese data
Toolkit to compile a comparable/parallel corpus from European Parliament proceedings
Benchmarks for intrinsic word embeddings evaluation.
Quantifying biases in BERT embeddings pretrained on MIMIC-III clinical notes
Python library & examples for Masked Language Model Scoring (ACL 2020)
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
Mini-library for producing graph visualizations from embedding models
Tracing Antisemitic Language Through Diachronic Embedding Projections: France 1789-1914
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Relevant data for the paper Evaluating Bias in Dutch Word Embeddings.
WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in…
Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.
Daemon to ban hosts that cause multiple authentication errors
Multilingual word vectors in 78 languages