Skip to content

Actions: huggingface/text-generation-inference

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
16,151 workflow runs
16,151 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Upload PR Documentation
Upload PR Documentation #176: completed by drbh
December 18, 2024 14:59 30s
December 18, 2024 14:59 30s
Improve vlm support (add idefics3 support)
Build PR Documentation #289: Pull request #2437 synchronize by drbh
December 18, 2024 14:58 47s improve-vlm-support
December 18, 2024 14:58 47s
Improve vlm support (add idefics3 support)
CI build #2066: Pull request #2437 synchronize by drbh
December 18, 2024 14:58 11h 5m 10s improve-vlm-support
December 18, 2024 14:58 11h 5m 10s
Improve vlm support (add idefics3 support)
Server Tests #3584: Pull request #2437 synchronize by drbh
December 18, 2024 14:58 8m 52s improve-vlm-support
December 18, 2024 14:58 8m 52s
Improve vlm support (add idefics3 support)
Automatic Documentation for Launcher #1956: Pull request #2437 synchronize by drbh
December 18, 2024 14:58 8m 25s improve-vlm-support
December 18, 2024 14:58 8m 25s
Improve vlm support (add idefics3 support)
Nix Tests #712: Pull request #2437 synchronize by drbh
December 18, 2024 14:58 7m 52s improve-vlm-support
December 18, 2024 14:58 7m 52s
fix: improve text model loading
Secret Leaks #2577: Commit 064e040 pushed by drbh
December 18, 2024 14:58 18s improve-vlm-support
December 18, 2024 14:58 18s
Add fp8 kv cache for ROCm
CI build #2065: Pull request #2856 opened by mht-sharma
December 18, 2024 14:56 1d 11h 28m 21s fp8_kvcache_rocm
December 18, 2024 14:56 1d 11h 28m 21s
Add fp8 kv cache for ROCm
Automatic Documentation for Launcher #1955: Pull request #2856 opened by mht-sharma
December 18, 2024 14:56 7m 40s fp8_kvcache_rocm
December 18, 2024 14:56 7m 40s
Add fp8 kv cache for ROCm
Server Tests #3583: Pull request #2856 opened by mht-sharma
December 18, 2024 14:56 8m 44s fp8_kvcache_rocm
December 18, 2024 14:56 8m 44s
Add fp8 kv cache for ROCm
Nix Tests #711: Pull request #2856 opened by mht-sharma
December 18, 2024 14:56 8m 14s fp8_kvcache_rocm
December 18, 2024 14:56 8m 14s
add fp8 kv cache for rocm
Secret Leaks #2576: Commit fa14d71 pushed by mht-sharma
December 18, 2024 14:56 18s fp8_kvcache_rocm
December 18, 2024 14:56 18s
misc(backend): lets try...
Secret Leaks #2575: Commit 6497964 pushed by mfuntowicz
December 18, 2024 14:18 20s trtllm/ci
December 18, 2024 14:18 20s
misc(backend): lets try...
Build TensorRT-LLM #44: Commit 6497964 pushed by mfuntowicz
December 18, 2024 14:18 6m 42s trtllm/ci
December 18, 2024 14:18 6m 42s
Add Flash decoding kernel ROCm
Automatic Documentation for Launcher #1954: Pull request #2855 opened by mht-sharma
December 18, 2024 12:50 7m 19s flash_decoding_rocm
December 18, 2024 12:50 7m 19s
Add Flash decoding kernel ROCm
CI build #2064: Pull request #2855 opened by mht-sharma
December 18, 2024 12:50 1d 13h 34m 25s flash_decoding_rocm
December 18, 2024 12:50 1d 13h 34m 25s
Add Flash decoding kernel ROCm
Nix Tests #710: Pull request #2855 opened by mht-sharma
December 18, 2024 12:50 5m 53s flash_decoding_rocm
December 18, 2024 12:50 5m 53s
Add Flash decoding kernel ROCm
Server Tests #3582: Pull request #2855 opened by mht-sharma
December 18, 2024 12:50 8m 14s flash_decoding_rocm
December 18, 2024 12:50 8m 14s
Merge branch 'main' into flash_decoding_rocm
Secret Leaks #2574: Commit 8936a03 pushed by mht-sharma
December 18, 2024 12:45 16s flash_decoding_rocm
December 18, 2024 12:45 16s
Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm
CI build #2063: Pull request #2825 synchronize by mht-sharma
December 18, 2024 12:15 1d 14h 9m 17s rocm-fp8-tensorwise
December 18, 2024 12:15 1d 14h 9m 17s
Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm
Nix Tests #709: Pull request #2825 synchronize by mht-sharma
December 18, 2024 12:15 6m 27s rocm-fp8-tensorwise
December 18, 2024 12:15 6m 27s
Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm
Automatic Documentation for Launcher #1953: Pull request #2825 synchronize by mht-sharma
December 18, 2024 12:15 11m 30s rocm-fp8-tensorwise
December 18, 2024 12:15 11m 30s
Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm
Server Tests #3581: Pull request #2825 synchronize by mht-sharma
December 18, 2024 12:15 7m 0s rocm-fp8-tensorwise
December 18, 2024 12:15 7m 0s
nm changes
Secret Leaks #2573: Commit f8771d0 pushed by mht-sharma
December 18, 2024 12:15 25s rocm-fp8-tensorwise
December 18, 2024 12:15 25s
Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm
CI build #2062: Pull request #2825 synchronize by mht-sharma
December 18, 2024 12:05 19m 22s rocm-fp8-tensorwise
December 18, 2024 12:05 19m 22s