# of sequential models (e.g., Transformers) or neural networks with small batch size. # It takes a vector :math:`x` as input and produces a vector :math:`y` of the same shape as output. # The ...
register_cuda_ci(est_time=125, suite="stage-b-kernel-unit-1-gpu-large") register_cuda_ci(est_time=500, suite="nightly-kernel-1-gpu", nightly=True) # JIT rmsnorm: fp16 ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results