bf16 scale/bias for INT4 (#5595) by jeetkanjani7 · Pull Request #5595 · pytorch/FBGEMM

jeetkanjani7 · 2026-04-08T06:56:22Z

Summary:
Add bf16 scale/bias support for INT4/INT2 fused N-bit rowwise quantization in FBGEMM and SilverTorch.

Previously, fused N-bit rowwise quantization only supported fp16 scale/bias - storing scale/bias in bf16 avoids precision loss from fp16 truncation during quantization round-trips.

X-link: https://github.com/facebookresearch/FBGEMM/pull/2551

Reviewed By: zhaozhul

Differential Revision: D95859348

meta-codesync · 2026-04-08T06:56:30Z

@jeetkanjani7 has exported this pull request. If you are a Meta employee, you can view the originating Diff in D95859348.

Summary: Add bf16 scale/bias support for INT4/INT2 fused N-bit rowwise quantization in FBGEMM and SilverTorch. Previously, fused N-bit rowwise quantization only supported fp16 scale/bias - storing scale/bias in bf16 avoids precision loss from fp16 truncation during quantization round-trips. X-link: facebookresearch/FBGEMM#2551 Differential Revision: D95859348

Summary: Add bf16 scale/bias support for INT4/INT2 fused N-bit rowwise quantization in FBGEMM and SilverTorch. Previously, fused N-bit rowwise quantization only supported fp16 scale/bias - storing scale/bias in bf16 avoids precision loss from fp16 truncation during quantization round-trips. X-link: https://github.com/facebookresearch/FBGEMM/pull/2551 Pull Request resolved: pytorch#5595 Differential Revision: D95859348

Summary: Add bf16 scale/bias support for INT4/INT2 fused N-bit rowwise quantization in FBGEMM and SilverTorch. Previously, fused N-bit rowwise quantization only supported fp16 scale/bias - storing scale/bias in bf16 avoids precision loss from fp16 truncation during quantization round-trips. X-link: facebookresearch/FBGEMM#2551 Differential Revision: D95859348

Summary: Add bf16 scale/bias support for INT4/INT2 fused N-bit rowwise quantization in FBGEMM and SilverTorch. Previously, fused N-bit rowwise quantization only supported fp16 scale/bias - storing scale/bias in bf16 avoids precision loss from fp16 truncation during quantization round-trips. X-link: facebookresearch/FBGEMM#2551 Reviewed By: zhaozhul Differential Revision: D95859348

Summary: Add bf16 scale/bias support for INT4/INT2 fused N-bit rowwise quantization in FBGEMM and SilverTorch. Previously, fused N-bit rowwise quantization only supported fp16 scale/bias - storing scale/bias in bf16 avoids precision loss from fp16 truncation during quantization round-trips. X-link: https://github.com/facebookresearch/FBGEMM/pull/2551 Pull Request resolved: pytorch#5595 Reviewed By: zhaozhul Differential Revision: D95859348

Summary: Add bf16 scale/bias support for INT4/INT2 fused N-bit rowwise quantization in FBGEMM and SilverTorch. Previously, fused N-bit rowwise quantization only supported fp16 scale/bias - storing scale/bias in bf16 avoids precision loss from fp16 truncation during quantization round-trips. X-link: facebookresearch/FBGEMM#2551 Reviewed By: zhaozhul Differential Revision: D95859348

meta-codesync · 2026-04-29T16:30:28Z

This pull request has been merged in 939f2da.

facebook-github-tools · 2026-05-01T21:20:59Z

This pull request has been reverted by 5e1dde6.

meta-cla Bot added the cla signed label Apr 8, 2026

meta-codesync Bot added fb-exported meta-exported labels Apr 8, 2026

meta-codesync Bot changed the title ~~bf16 scale/bias for INT4~~ bf16 scale/bias for INT4 (#5595) Apr 13, 2026

jeetkanjani7 force-pushed the export-D95859348 branch from 962a486 to c9a4ef8 Compare April 13, 2026 06:19

jeetkanjani7 force-pushed the export-D95859348 branch from c9a4ef8 to 1b262b9 Compare April 13, 2026 06:20

jeetkanjani7 force-pushed the export-D95859348 branch 2 times, most recently from 39d74bc to b65c26e Compare April 13, 2026 06:28

jeetkanjani7 force-pushed the export-D95859348 branch from b65c26e to f222b33 Compare April 21, 2026 22:39

jeetkanjani7 force-pushed the export-D95859348 branch from f222b33 to 1a0f21b Compare April 23, 2026 05:46

jeetkanjani7 force-pushed the export-D95859348 branch 2 times, most recently from eb5b190 to d0aa8bb Compare April 24, 2026 17:48

jeetkanjani7 force-pushed the export-D95859348 branch from d0aa8bb to ec9a50e Compare April 24, 2026 17:50

jeetkanjani7 force-pushed the export-D95859348 branch from ec9a50e to a98dd37 Compare April 24, 2026 18:54

jeetkanjani7 force-pushed the export-D95859348 branch from a98dd37 to 1eef2bd Compare April 24, 2026 18:55

jeetkanjani7 force-pushed the export-D95859348 branch from 1eef2bd to f5f727c Compare April 24, 2026 18:57

jeetkanjani7 force-pushed the export-D95859348 branch from f5f727c to 3abc3b5 Compare April 24, 2026 18:57

jeetkanjani7 force-pushed the export-D95859348 branch from 3abc3b5 to 4b75cba Compare April 24, 2026 19:03

jeetkanjani7 force-pushed the export-D95859348 branch 2 times, most recently from 4db39c2 to 3c73db9 Compare April 27, 2026 21:37

jeetkanjani7 force-pushed the export-D95859348 branch from 3c73db9 to 1cc0b5c Compare April 28, 2026 15:33

meta-codesync Bot closed this in 939f2da Apr 29, 2026

facebook-github-tools Bot added the Merged label Apr 29, 2026

facebook-github-tools Bot added the Reverted label May 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bf16 scale/bias for INT4 (#5595)#5595

bf16 scale/bias for INT4 (#5595)#5595
jeetkanjani7 wants to merge 1 commit intopytorch:mainfrom
jeetkanjani7:export-D95859348

jeetkanjani7 commented Apr 8, 2026 •

edited by meta-codesync Bot

Loading

Uh oh!

meta-codesync Bot commented Apr 8, 2026

Uh oh!

meta-codesync Bot commented Apr 29, 2026

Uh oh!

facebook-github-tools Bot commented May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jeetkanjani7 commented Apr 8, 2026 • edited by meta-codesync Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

meta-codesync Bot commented Apr 8, 2026

Uh oh!

meta-codesync Bot commented Apr 29, 2026

Uh oh!

facebook-github-tools Bot commented May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jeetkanjani7 commented Apr 8, 2026 •

edited by meta-codesync Bot

Loading