Add support for Gemma 3 (text) #1229

xenova · 2025-03-13T02:26:14Z

Currently only the 1B model has been converted, but I'll make conversions for the rest soon!

Example usage:

import { pipeline } from "@huggingface/transformers";

// Create a text generation pipeline
const generator = await pipeline(
  "text-generation",
  "onnx-community/gemma-3-1b-it-ONNX",
  { dtype: "q4" },
);

// Define the list of messages
const messages = [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Write me a poem about Machine Learning." },
];

// Generate a response
const output = await generator(messages, { max_new_tokens: 512, do_sample: false });
console.log(output[0].generated_text.at(-1).content);

See example output

Okay, here's a poem about Machine Learning, aiming to capture its essence and potential:

**The Learning Algorithm**

A silent hum, a coded grace,
A network grows, a steady pace.
Machine Learning, swift and bright,
Unraveling data, day and night.

It learns from patterns, subtle, deep,
Where errors hide, secrets sleep.
With data flowing, vast and wide,
It builds a model, side by side.

Predicting trends, forecasting near,
Recognizing faces, calming fear.
Classifying images, text so true,
Discovering insights, shining new.

From spam filters, swift and keen,
To recommending what you’ve seen,
It learns your habits, day by day,
Improving swiftly, come what may.

But caution whispers, a gentle plea,
“Control the bias, let it be free.”
For ethics guide, a crucial art,
To use this power, play a vital part.

So let the learning algorithm flow,
Expanding knowledge, watch it grow.
A powerful tool, a future bright,
Machine Learning, shining light.

---

Would you like me to tweak this poem in any way? For example, would you like me to:

*   Focus on a specific application (e.g., image recognition)?
*   Adjust the tone (e.g., more optimistic, more cautionary)?

HuggingFaceDocBuilderDev · 2025-03-13T02:28:10Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Glavin001 · 2025-03-18T18:00:21Z

Anything we can do to help this PR get merged? 🙂

maciejwolski · 2025-03-24T09:51:55Z

I am waiting for this too

aribornstein · 2025-03-24T10:43:16Z

Me as well

xenova · 2025-03-24T21:13:29Z

The model works well in Node.js, but for some browsers, we were having some issues when running in-browser due to the large embedding layer. So, we're working on some optimizations so that it runs well on WebGPU.

If anyone would like to build and test this PR locally, that would help a ton!

dariuszbasiak · 2025-03-25T14:46:24Z

I'm trying to run in in Chrome in MacOS but getting error: ERROR 3304823240
This number seems to be Conten-Length of buffer that fails to be created

xenova · 2025-03-25T18:39:56Z

cc @guschmue. Maybe model builder will help fix that? Any updates on that?

xenova · 2025-03-25T21:36:50Z

Since the model works correct in Node.js (and the only remaining issue is browser support due to the large embedding size), I'll merge this PR and update the weights eventually once microsoft/onnxruntime-genai#1329 is ready.

Usage should not change once a newer export is created.

dariuszbasiak · 2025-03-26T08:24:16Z

Maybe for someone it will be helpful. I run q8 gemma3 in the browser. It is slow but worked ;)

xenova · 2025-03-26T15:13:06Z

Maybe for someone it will be helpful. I run q8 gemma3 in the browser. It is slow but worked ;)

Great to hear! Is that on WebGPU or WASM? Does q4/q4f16 work for you?

dariuszbasiak · 2025-03-26T16:41:42Z

I run q8 few time with example code, but after 3 time it start generating random text. I try other dtypes and only this one work. But it was really slow it took ~4 mins to initialize it and generate something.

guschmue · 2025-03-26T20:12:22Z

cc @guschmue. Maybe model builder will help fix that? Any updates on that?
Model builder folks are working on it, initially gemma-3-1b-it.
With their latest changes it mostly works for me, there is one more issue that needs to be fixed imo.

xenova added 2 commits March 13, 2025 02:24

Add support for gemma 3 (text)

6e54cd9

Fix tensor slicing past boundaries

2f0855a

TimPietrusky mentioned this pull request Mar 16, 2025

Gemma 3 still unsupported? #1239

Closed

5 tasks

xenova changed the title ~~Add support for Gemma 3~~ Add support for Gemma 3 (text) Mar 25, 2025

xenova merged commit 06a84b5 into main Mar 25, 2025
3 of 4 checks passed

xenova deleted the new-model branch March 25, 2025 21:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Gemma 3 (text) #1229

Add support for Gemma 3 (text) #1229

xenova commented Mar 13, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 13, 2025

Glavin001 commented Mar 18, 2025

maciejwolski commented Mar 24, 2025

aribornstein commented Mar 24, 2025

xenova commented Mar 24, 2025

dariuszbasiak commented Mar 25, 2025

xenova commented Mar 25, 2025

xenova commented Mar 25, 2025

dariuszbasiak commented Mar 26, 2025

xenova commented Mar 26, 2025

dariuszbasiak commented Mar 26, 2025

guschmue commented Mar 26, 2025

Add support for Gemma 3 (text) #1229

Add support for Gemma 3 (text) #1229

Conversation

xenova commented Mar 13, 2025 • edited Loading

HuggingFaceDocBuilderDev commented Mar 13, 2025

Glavin001 commented Mar 18, 2025

maciejwolski commented Mar 24, 2025

aribornstein commented Mar 24, 2025

xenova commented Mar 24, 2025

dariuszbasiak commented Mar 25, 2025

xenova commented Mar 25, 2025

xenova commented Mar 25, 2025

dariuszbasiak commented Mar 26, 2025

xenova commented Mar 26, 2025

dariuszbasiak commented Mar 26, 2025

guschmue commented Mar 26, 2025

xenova commented Mar 13, 2025 •

edited

Loading