Pulse · tenstorrent/tt-metal

February 5, 2025 – February 12, 2025

186 Active pull requests

152 Active issues

v0.56.0-rc10
published Feb 5, 2025
v0.56.0-rc16
published Feb 8, 2025
v0.56.0-rc21
published Feb 12, 2025
v0.56.0-rc24
published Feb 12, 2025

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

[tt-train] Add RMSNorm module
#16991 commented on Feb 11, 2025 • 12 new comments
New `split` based on `slice`
#17461 commented on Feb 9, 2025 • 8 new comments
#15450: Remove default values from circular buffer parameters in LLK compute APIs: Matmul
#16571 commented on Feb 7, 2025 • 4 new comments
Add avg_pool2d with kernel size support
#14268 commented on Feb 6, 2025 • 3 new comments
Allow the user to select the version of the docs
#17434 commented on Feb 11, 2025 • 2 new comments
#16174: Support for int32 subtraction for WHB0 and BH
#17359 commented on Feb 5, 2025 • 1 new comment
#15450: Remove default values from circular buffer parameters in LLK compute APIs: Docs
#17567 commented on Feb 7, 2025 • 0 new comments
Create knowledge sharing doc to explain Python packaging and wheel setup
#12707 commented on Feb 12, 2025 • 0 new comments
Build on Ubuntu 22.04
#14390 commented on Feb 12, 2025 • 0 new comments
Use a stable serialization format for caching tensors on disk
#16067 commented on Feb 11, 2025 • 0 new comments
Clean Device init APIs
#17209 commented on Feb 11, 2025 • 0 new comments
[Feature Request] Prebuilt `*.deb` packages and PPA for TT software
#7915 commented on Feb 11, 2025 • 0 new comments
Remove obselete dependencies and try moving remaining ones into CPM
#9407 commented on Feb 11, 2025 • 0 new comments
Breakout/Optimize Perf Microbenchmark Tests
#16774 commented on Feb 11, 2025 • 0 new comments
Anaconda Support
#13734 commented on Feb 11, 2025 • 0 new comments
implement rand for WH/BH
#14597 commented on Feb 11, 2025 • 0 new comments
investigate unsigned comparisons
#14598 commented on Feb 11, 2025 • 0 new comments
ttnn.fmod unary low PCC when scalar is between -0.003 and 0.003
#17362 commented on Feb 11, 2025 • 0 new comments
ttnn.remainder unary low PCC when scalar is between -0.003 and 0.003
#17361 commented on Feb 11, 2025 • 0 new comments
Yolov11 - Model card
#13772 commented on Feb 11, 2025 • 0 new comments
Unit tests and models fail new TT_FATAL validation for sharding
#16948 commented on Feb 10, 2025 • 0 new comments
[Bug Report] captured_graph is missing buffer address on both L1 and DRAM
#16499 commented on Feb 10, 2025 • 0 new comments
SFPU shift operator issue when using sfpi
#15514 commented on Feb 10, 2025 • 0 new comments
Multidevice tensors do not work in comparison mode
#15363 commented on Feb 10, 2025 • 0 new comments
[Bug Report] `dprint_tensix_dest_reg` Bug
#17481 commented on Feb 10, 2025 • 0 new comments
[Feature Request] Support large tensor sizes in ttnn.conv2d
#17489 commented on Feb 10, 2025 • 0 new comments
Remove "Reach-Arounds" in TT-NN interfacing with TT-Metal
#17199 commented on Feb 10, 2025 • 0 new comments
[Feature Request] Light Metal Feature parent/tracking ticket
#17037 commented on Feb 10, 2025 • 0 new comments
Fix shape in outer
#17492 commented on Feb 7, 2025 • 0 new comments
Fix stored size of sharded buffers to match what device buffer expects
#17450 commented on Feb 5, 2025 • 0 new comments
#17218: Add output_dtype support for binary_ng
#17417 commented on Feb 6, 2025 • 0 new comments
Printing packer's and unpacker's configuration registers
#17368 commented on Feb 7, 2025 • 0 new comments
#16147: Replace binary with binary_ng
#17160 commented on Feb 7, 2025 • 0 new comments
Support parallelization over width for tilize with val padding
#17100 commented on Feb 5, 2025 • 0 new comments
Enable ConvMnist and Mnist integration and performance tests.
#16965 commented on Feb 6, 2025 • 0 new comments
#16888: Fix Conv2D when output is in Row Major
#16937 commented on Feb 6, 2025 • 0 new comments
Refactor llama3 demo to the new generator API
#16753 commented on Feb 7, 2025 • 0 new comments
#14080: Preprocess weights for Conv2D on Device
#16750 commented on Feb 6, 2025 • 0 new comments
[WIP] [TT-Train] TTNN Training
#16617 commented on Feb 6, 2025 • 0 new comments
Use NOC stream registers for signaling
#16558 commented on Feb 10, 2025 • 0 new comments
TTNN generic OP
#16546 commented on Feb 12, 2025 • 0 new comments
Fix typos.
#15365 commented on Feb 7, 2025 • 0 new comments
#14732: add bert-tiny test_performance using trace and 2cq-WIP
#14799 commented on Feb 7, 2025 • 0 new comments
#0: async
#11158 commented on Feb 12, 2025 • 0 new comments
Run CI tests in 20.04 docker
#12498 commented on Feb 12, 2025 • 0 new comments
Yolov7 Trace+2cq fails with Out of Memoy issue
#17583 commented on Feb 12, 2025 • 0 new comments
[Bug Report] tt_simulation_device.cpp is always compiled.
#15161 commented on Feb 12, 2025 • 0 new comments
Add profiler-enabled wheel to release assets, despite it not working yet
#14301 commented on Feb 12, 2025 • 0 new comments
Upgrade CI runners to run Ubuntu 22.04 natively
#12492 commented on Feb 12, 2025 • 0 new comments
ttnn.neg, ttnn.abs, ttnn.selu and ttnn.identity give low pcc with sharded input
#16181 commented on Feb 7, 2025 • 0 new comments
ttnn.fill_implicit_tile_padding hangs for bfloat8_b
#17077 commented on Feb 7, 2025 • 0 new comments
FD ring buffer is stalling too often
#15221 commented on Feb 6, 2025 • 0 new comments
as_tensor fails when saving/loading a tensor with transposed tiles
#15496 commented on Feb 6, 2025 • 0 new comments
Request for SFPU LLKs for more flexible broadcasting
#16103 commented on Feb 6, 2025 • 0 new comments
ttnn.to_dtype conversion issue from bfloat8_b to bfloat16
#17159 commented on Feb 6, 2025 • 0 new comments
Add explicit BroadcastOp for TTNN
#16015 commented on Feb 6, 2025 • 0 new comments
Add retry loop when calling gh api during _produce_data.yaml to recover from rate limiting
#17374 commented on Feb 6, 2025 • 0 new comments
Remove default value for output operand (16) across BH LLK API calls
#15450 commented on Feb 6, 2025 • 0 new comments
[Ops] Support for Conv3d op (ttnn.Conv3d)
#15103 commented on Feb 6, 2025 • 0 new comments
[Feature Request] Support large tensor sizes in ttnn.group_norm
#17490 commented on Feb 6, 2025 • 0 new comments
REVERSE: Returns a tensor with the data reversed along the given axis.
#17116 commented on Feb 5, 2025 • 0 new comments
Blackhole: conv2d tests PCC failure when input channels = 16 (<32)
#16992 commented on Feb 5, 2025 • 0 new comments
Resnet50 on Blackhole: Optimizations
#17393 commented on Feb 5, 2025 • 0 new comments
TM Failures on BH
#17230 commented on Feb 5, 2025 • 0 new comments
PCC failure from `ttnn.moreh_norm` for non-last dim
#16335 commented on Feb 5, 2025 • 0 new comments
CPP Unit test MultiCommandQueueSingleDeviceFixture.TestMultiAppThreadSync hangs
#17345 commented on Feb 5, 2025 • 0 new comments
Async FD out of Eth cores on BH hang
#16643 commented on Feb 5, 2025 • 0 new comments
CCL Ops Test hang to be disabled
#17344 commented on Feb 5, 2025 • 0 new comments
repeat pytest hitting op assert
#14518 commented on Feb 5, 2025 • 0 new comments
[Feature Request] Improvement Needed for Unit Tests
#6633 commented on Feb 5, 2025 • 0 new comments
Tracy profiler on BH not working
#17099 commented on Feb 5, 2025 • 0 new comments
VMs losing communication with the GH server
#17240 commented on Feb 5, 2025 • 0 new comments
Make comparison mode work with fast runtime mode
#16762 commented on Feb 5, 2025 • 0 new comments
[Feature Request] Implement scatter communication mechanism on-device
#17314 commented on Feb 10, 2025 • 0 new comments
Missing interface - scatter
#16942 commented on Feb 10, 2025 • 0 new comments
Missing interface - gather
#16941 commented on Feb 10, 2025 • 0 new comments
Eltwise Master Tracking
#13795 commented on Feb 9, 2025 • 0 new comments
ttnn.maximum unsupported broadcast
#14852 commented on Feb 8, 2025 • 0 new comments
Matmul hang on BH
#16439 commented on Feb 8, 2025 • 0 new comments
sometimes on GS max tests fail when all tests in file are run
#17084 commented on Feb 8, 2025 • 0 new comments
Resnet50 on Blackhole: Using pre-trained data gives bad PCC
#17558 commented on Feb 8, 2025 • 0 new comments
[Bug Report] Matmul gives nondeterministic result
#17143 commented on Feb 7, 2025 • 0 new comments
Add handling in logs + artifacts download script for data collection for logs that don't exist
#12966 commented on Feb 7, 2025 • 0 new comments
Low PCC in LeNet Data Parallel with ttnn.reshape in TILE_LAYOUT.
#15422 commented on Feb 7, 2025 • 0 new comments
Fix CCL PCC error with Sharded Addrgen on disjointed core ranges
#17391 commented on Feb 7, 2025 • 0 new comments
Split TTNN into C++ library and Python binding
#16418 commented on Feb 7, 2025 • 0 new comments
Incorrect ttnn.linear result for activations with shape Mx1xN
#16599 commented on Feb 7, 2025 • 0 new comments
Llama3 model family - list of required ops for blackhole
#16013 commented on Feb 7, 2025 • 0 new comments
Multichip ops
#17246 commented on Feb 7, 2025 • 0 new comments
Stable diffusion 3.5 medium - Bring up
#15969 commented on Feb 7, 2025 • 0 new comments
Fabric EDM Optimization (to 10+ GB/s per direction):
#17423 commented on Feb 7, 2025 • 0 new comments
[Bug Report] reshape/permute incorrect outputs on multidevice
#17535 commented on Feb 7, 2025 • 0 new comments
[Master issue] Data pipeline and benchmarking infrastructure
#10718 commented on Feb 7, 2025 • 0 new comments
[Bug Report] TTNN typecast operation fails when the tensor is on host
#16279 commented on Feb 7, 2025 • 0 new comments
[Feature Request] Conv2d dram slicing
#17493 commented on Feb 7, 2025 • 0 new comments
Int32 support for subtract op
#16174 commented on Feb 7, 2025 • 0 new comments
Incorrect data from ttnn.from_torch for sharding
#15565 commented on Feb 7, 2025 • 0 new comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

February 5, 2025 – February 12, 2025

Insights: tenstorrent/tt-metal

February 5, 2025 – February 12, 2025

Overview

Could not load contribution data

4 Releases published by 1 person

127 Pull requests merged by 52 people

59 Pull requests opened by 41 people

69 Issues closed by 36 people

83 Issues opened by 49 people

97 Unresolved conversations