-
H2O.ai
- United States
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
A Framework of Small-scale Large Multimodal Models
LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer
Official code for Paper "Mantis: Multi-Image Instruction Tuning" (TMLR2024)
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A high-throughput and memory-efficient inference and serving engine for LLMs
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
Realtime Web Apps and Dashboards for Python and R
A guidance language for controlling large language models.
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
Repository for the Lux AI Challenge, season 2 (NeurIPS 23). Hosted on @kaggle
200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Rotation and skew detection using DL.
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Team original solutions for the Human Protein Atlas image classification competition
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
h2oai / unilm
Forked from microsoft/unilmUniLM - Unified Language Model Pre-training / Pre-training for NLP and Beyond
A PyTorch impl of EfficientDet faithful to the original Google impl w/ ported weights
This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
10th place solution of the Jigsaw Unintended Bias in Toxicity Classification
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
Convert scans of handwritten notes to beautiful, compact PDFs