Sign in
chromium
/
external
/
github.com
/
google
/
XNNPACK
/
HEAD
5df1362
Mark `binary_test` with `timeout = "moderate"` to avoid flakiness due to timeouts.
by Pedro Gonnet
· 2 hours ago
upstream/master
b58f508
Add `slice_dim0` helper
by Dillon Sharlet
· 3 hours ago
d06fc11
Merge pull request #9935 from wangw-1991:fix_bugs
by XNNPACK Team
· 3 hours ago
ac9bc35
Adds missing GEMMBenchmark overload for qp8_f32_qc8w_gemm_minmax and corrects qp8_f32_qc4w microkernels and associated functions to use xnn_f32_qc4w_minmax_params instead of xnn_f32_minmax_params.
by Samuel Fuller
· 3 hours ago
704a6d7
Demote some logging for subgraph rewrites from `xnn_log_info` to `xnn_log_debug`.
by Pedro Gonnet
· 3 hours ago
c9e961c
Initial upload.
by Wei Wang
· 13 hours ago
7f56155
Minor simplifications of static_broadcast nodes
by Dillon Sharlet
· 20 hours ago
9344bcb
Give slinky our assumptions to reduce overhead
by Dillon Sharlet
· 22 hours ago
7a55567
Update slinky in XNNPACK
by Dillon Sharlet
· 22 hours ago
eed1807
Handle identity casts for scalar vectors in simd wrappers
by Volodymyr Kysenko
· 23 hours ago
f4ca248
Add fp64 reduce support
by Dillon Sharlet
· 23 hours ago
211341d
Add fp64 support to simd::vec for x86 and ARM64
by Dillon Sharlet
· 25 hours ago
chromium/7784
1b30163
Merge pull request #9858 from GregoryComer:bf16-rminmax-scalar
by XNNPACK Team
· 2 days ago
a902cb3
Merge pull request #9879 from keaganchern:rvv-f32-vmulcaddc
by XNNPACK Team
· 2 days ago
88a78ab
Add subgraph rewrite for `x/sqrt(y)` to `x*reciprocal_square_root(y)`
by Dillon Sharlet
· 2 days ago
7ef819c
updating kernel and testing infra
by kchern2@berkeley.edu
· 2 days ago
c7903c9
Add WASM SIMD128 kernel for dequantize_dot.
by Volodymyr Kysenko
· 2 days ago
f84ade5
Remove formatting check from gn build
by Dillon Sharlet
· 2 days ago
fcdddfb
Don't rewrite dequantize_dot if there is no optimized kernel.
by Volodymyr Kysenko
· 2 days ago
d6097c2
Avoid `raw_buffer::cast` in dynamic quantization implementation
by Dillon Sharlet
· 2 days ago
37d77e5
[gn] fix: do not cache Git repos
by Richard Townsend
· 2 days ago
c1bff3f
Fix mismatching function signatures.
by Volodymyr Kysenko
· 2 days ago
925f0e9
Add WASM SIMD128 int8xint8->int32 dot product kernels.
by Volodymyr Kysenko
· 2 days ago
6c008cf
Optimize the subgraph glue code for reduce kernels
by Dillon Sharlet
· 2 days ago
24332a0
Adjust sharding of reduce kernel tests
by Dillon Sharlet
· 3 days ago
f82740c
f16_vexp and f16_vapproxgelu - remove arm fp16 requirement from scalar microkernel tests
by Frank Barchard
· 3 days ago
25b9b12
f16_vgelu - remove arm fp16 requirement from scalar microkernel tests
by Frank Barchard
· 3 days ago
6938f34
Fixes to enable Hexagon to pass continuous testing
by Dillon Sharlet
· 3 days ago
762ae7c
Relax tolerances of `dequantize_dot` test
by Dillon Sharlet
· 3 days ago
ea6c92e
Set WASM parameters for threading and update some of the tests accordingly.
by Volodymyr Kysenko
· 3 days ago
24d51bb
Set stack sizes for WASM.
by Volodymyr Kysenko
· 3 days ago
4b71166
Merge pull request #9903 from ken-unger:rvv-fp16-detect-fix
by XNNPACK Team
· 3 days ago
e2b2b9d
Fix F16_VGELU unittest crash on ARM
by Frank Barchard
· 3 days ago
6b30acd
Add RMS norm benchmark
by Dillon Sharlet
· 3 days ago
b5bdf59
fix comment
by Ken Unger
· 3 days ago
225c8e8
fix RISCV_HWPROBE_EXT_ZVFH when not defined
by Ken Unger
· 3 days ago
d0a0412
Actually use WASM SIMD128 FP64 dot kernels by including them in kernel.inc
by Volodymyr Kysenko
· 3 days ago
202cf6e
added additional branching for vmulcaddc to prevent crashing
by kchern2@berkeley.edu
· 3 days ago
5872ba3
Fix for MSVC build - remove avx2 from fma3 header.
by Frank Barchard
· 3 days ago
3f281b3
Remove capture of variables by reference across threads.
by Marie White
· 4 days ago
5ab29b0
Add more m=1 kernels
by Dillon Sharlet
· 4 days ago
aee6b1b
Add FP16 detect for RVV
by Frank Barchard
· 4 days ago
5ab74a4
Print ccache stats for Windows builds.
by Alexander Shaposhnikov
· 4 days ago
79f8208
Use `fuse_and_slice_leading_dims` + `for_each_element` for `compute_qd8_params`
by Dillon Sharlet
· 4 days ago
48365ce
Optimize `fuse_and_slice_leading_dims`
by Dillon Sharlet
· 4 days ago
e238480
Add m = 1 kernel for tile_k = 16 dot kernels
by Dillon Sharlet
· 4 days ago
80bc38c
Add fp64 dot kernels for wasm simd.
by Volodymyr Kysenko
· 4 days ago
b5e60c2
Merge pull request #9693 from ken-unger:unary-rvv
by XNNPACK Team
· 4 days ago
9db1fb7
Add benchmark of `fuse_and_slice_leading_dims`
by Dillon Sharlet
· 4 days ago
0dc6aba
Replace `broadcast_or_load` with specialization for `offset` in `dequantize_dot`
by Dillon Sharlet
· 4 days ago
802acce
Add WASM SIMD128 FP32 dot product kernels.
by Volodymyr Kysenko
· 4 days ago
76f79df
Add a `dequantize_dot` kernel
by Dillon Sharlet
· 4 days ago
e4ae519
Make wasm simd128 arch name consistent with others.
by Volodymyr Kysenko
· 4 days ago
ad25fa0
Add variable-length RVV f32-vmulcaddc-minmax microkernel
by kchern2@berkeley.edu
· 4 days ago
6293fad
Fix test:buffer_test asan failure
by Frank Barchard
· 4 days ago
b095cc6
Remove F32 ReLU GEMM/IGEMM microkernels
by Frank Barchard
· 4 days ago
db167df
Minor cleanups
by Dillon Sharlet
· 4 days ago
5ca7819
Add scalar parameters for more unary kernels
by Dillon Sharlet
· 4 days ago
c60479f
Merge branch 'google:master' into unary-rvv
by Ken Unger
· 4 days ago
de3fca8
adding f32-vmulcaddc rvv microkernel
by kchern2@berkeley.edu
· 5 days ago
9263081
Fix neonsme2 warning script tools/generate-gemm-test.py
by Frank Barchard
· 7 days ago
5407a22
Run generators to update RVV microkernels
by Frank Barchard
· 7 days ago
8c540e4
Refactor `schedule_dot` and `run_dot` to not capture parameters of A and B
by Dillon Sharlet
· 7 days ago
4453b06
Refactor Dockerfile.standard and add riscv64 container build
by Alexander Shaposhnikov
· 7 days ago
6d11164
GEMM test generator fix for linear and relu - set output to input if not set
by Frank Barchard
· 7 days ago
3af7056
Remove whitespace from rvv qd8 gemm
by Frank Barchard
· 7 days ago
353d065
Strengthen fusion of constant arithmetic
by Dillon Sharlet
· 7 days ago
8bfc09d
Add benchmark support for f32-bf16-vcvt
by Gregory Comer
· 7 days ago
ffba580
Add transpose and interleave kernels for wasm simd128.
by Volodymyr Kysenko
· 7 days ago
7019283
Add interleave simd wrappers for wasm.
by Volodymyr Kysenko
· 7 days ago
3582cbd
Implement approxgelu in YNNPACK
by Dillon Sharlet
· 7 days ago
b8455c9
Add WASM SIMD128 kernels for FP32 sum and sum of squares.
by Volodymyr Kysenko
· 7 days ago
43df024
Add transpose and its supporting functions to wasm simd wrappers.
by Volodymyr Kysenko
· 7 days ago
9dd4192
Add min, max, minmax reduce kernels for wasm.
by Volodymyr Kysenko
· 7 days ago
8c9c00f
Add explicit dimensions for transposed A
by Dillon Sharlet
· 7 days ago
319c384
Add horizontal_min/horizontal_max implementation to wasm simd wrappers
by Volodymyr Kysenko
· 7 days ago
fd746bf
Add missing min/max overloads for WASM SIMD.
by Volodymyr Kysenko
· 7 days ago
a2d8b20
Add params to `erf` kernel and use them to implement `gelu`. Mostly AI generated, here is what AI had to say:
by Dillon Sharlet
· 7 days ago
9cd282f
Fix discrepancy in a_stride_m/a_stride_k for transposed dot kernels
by Dillon Sharlet
· 7 days ago
dfa0f4b
Add support for passing scalar parameters to elementwise kernels.
by Volodymyr Kysenko
· 8 days ago
e4a3e71
Update bazel build for bf16 minmax kernels
by Gregory Comer
· 8 days ago
4f95611
Remove unnecessary convert after quantized dots with float outputs
by Dillon Sharlet
· 8 days ago
f086008
Add scalar bf16-rminmax microkernels, configs, and tests
by Gregory Comer
· 8 days ago
015e3e6
Add `ynn_define_slice_like` and use it to implement `xnn_define_rope`
by Dillon Sharlet
· 8 days ago
94a13a8
Merge pull request #9844 from yolanda15:qd8_update_simd
by XNNPACK Team
· 8 days ago
3ef1548
Fix warning on some builds
by Dillon Sharlet
· 9 days ago
cbd072c
Merge pull request #9846 from yolanda15:qd8_update_sdot
by XNNPACK Team
· 9 days ago
8ebc71c
Merge pull request #9812 from ken-unger:f16-softmax2-rvv
by XNNPACK Team
· 9 days ago
3b9ec79
Merge pull request #9727 from GregoryComer:f32-bf16-vcvt-scalar
by XNNPACK Team
· 9 days ago
f3976a3
Update qd8 gemm config to enable c4 microkernels for wasmsdot
by Yolanda Chen
· 9 days ago
cf89f69
Update qd8 gemm config to enable 4x16c2s2 microkernels for wasm simd
by Yolanda Chen
· 10 days ago
e0fe33a
Merge pull request #9753 from yolanda15:qd8_update_usdot
by XNNPACK Team
· 10 days ago
1812bbe
Merge pull request #9833 from mohammadmseet-hue:fix/bmm-num-batch-dims-check
by XNNPACK Team
· 10 days ago
chromium/7767
chromium/7768
chromium/7769
chromium/7770
chromium/7771
chromium/7772
chromium/7773
chromium/7774
chromium/7775
chromium/7776
chromium/7777
chromium/7778
chromium/7778_2
chromium/7779
chromium/7780
chromium/7781
chromium/7782
chromium/7783
0304f05
Fix incorrectly disabled benchmarks and tests
by Dillon Sharlet
· 10 days ago
d0a338b
Fix warnings on some builds
by Dillon Sharlet
· 10 days ago
856538e
Update build_srcs.bzl for bf16 scalar cvt
by Gregory Comer
· 10 days ago
e0de998
Add notap tag to manual targets
by Dillon Sharlet
· 10 days ago
28e1a13
Fix stack buffer overflow in batch matrix multiply reshape
by mohammadmseet-hue
· 10 days ago
3d7a7d6
Fix stack buffer overflow in static_slice and missing bounds checks in copy ops
by mohammadmseet-hue
· 10 days ago
beea6f7
Remove redundant dim bounds checks
by Dillon Sharlet
· 10 days ago
Next »