Skip to content

Actions: huggingface/text-generation-inference

Automatic Documentation for Launcher

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,843 workflow runs
1,843 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Enable qwen2vl video
Automatic Documentation for Launcher #1969: Pull request #2756 synchronize by drbh
December 23, 2024 18:47 7m 23s enable-qwen2vl-video
December 23, 2024 18:47 7m 23s
Improve vlm support (add idefics3 support)
Automatic Documentation for Launcher #1968: Pull request #2437 synchronize by drbh
December 23, 2024 14:40 7m 20s improve-vlm-support
December 23, 2024 14:40 7m 20s
docs(conceptual/speculation): available links Train Medusa
Automatic Documentation for Launcher #1967: Pull request #2863 opened by guspan-tanadi
December 23, 2024 05:15 Action required guspan-tanadi:linksection
December 23, 2024 05:15 Action required
Basic flashinfer 0.2 support
Automatic Documentation for Launcher #1966: Pull request #2862 synchronize by danieldk
December 22, 2024 13:04 7m 12s flashinfer-0.2
December 22, 2024 13:04 7m 12s
Basic flashinfer 0.2 support
Automatic Documentation for Launcher #1965: Pull request #2862 opened by danieldk
December 22, 2024 12:24 7m 8s flashinfer-0.2
December 22, 2024 12:24 7m 8s
Improve vlm support (add idefics3 support)
Automatic Documentation for Launcher #1964: Pull request #2437 synchronize by drbh
December 21, 2024 00:27 7m 11s improve-vlm-support
December 21, 2024 00:27 7m 11s
Fix docker run in README.md
Automatic Documentation for Launcher #1963: Pull request #2861 opened by alvarobartt
December 20, 2024 10:22 7m 12s fix-docker-run-readme
December 20, 2024 10:22 7m 12s
Flash decoding kernel adding and prefill-chunking and prefix caching enabling in intel cpu/xpu
Automatic Documentation for Launcher #1962: Pull request #2815 synchronize by sywangyi
December 20, 2024 04:53 Action required sywangyi:flash_decoding
December 20, 2024 04:53 Action required
fix: include add_special_tokens in kserve request
Automatic Documentation for Launcher #1961: Pull request #2859 opened by drbh
December 19, 2024 21:54 7m 10s kserve-request-patch
December 19, 2024 21:54 7m 10s
Efficient Transformers backend support
Automatic Documentation for Launcher #1960: Pull request #2858 synchronize by Cyrilvallez
December 19, 2024 17:49 7m 13s Cyrilvallez:transformers-backend
December 19, 2024 17:49 7m 13s
Efficient Transformers backend support
Automatic Documentation for Launcher #1959: Pull request #2858 opened by Cyrilvallez
December 19, 2024 17:47 7m 21s Cyrilvallez:transformers-backend
December 19, 2024 17:47 7m 21s
Flash decoding kernel adding and prefill-chunking and prefix caching enabling in intel cpu/xpu
Automatic Documentation for Launcher #1958: Pull request #2815 synchronize by sywangyi
December 19, 2024 11:21 Action required sywangyi:flash_decoding
December 19, 2024 11:21 Action required
Improve vlm support (add idefics3 support)
Automatic Documentation for Launcher #1957: Pull request #2437 synchronize by drbh
December 19, 2024 01:54 7m 1s improve-vlm-support
December 19, 2024 01:54 7m 1s
Improve vlm support (add idefics3 support)
Automatic Documentation for Launcher #1956: Pull request #2437 synchronize by drbh
December 18, 2024 14:58 8m 25s improve-vlm-support
December 18, 2024 14:58 8m 25s
Add fp8 kv cache for ROCm
Automatic Documentation for Launcher #1955: Pull request #2856 opened by mht-sharma
December 18, 2024 14:56 7m 40s fp8_kvcache_rocm
December 18, 2024 14:56 7m 40s
Add Flash decoding kernel ROCm
Automatic Documentation for Launcher #1954: Pull request #2855 opened by mht-sharma
December 18, 2024 12:50 7m 19s flash_decoding_rocm
December 18, 2024 12:50 7m 19s
Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm
Automatic Documentation for Launcher #1953: Pull request #2825 synchronize by mht-sharma
December 18, 2024 12:15 11m 30s rocm-fp8-tensorwise
December 18, 2024 12:15 11m 30s
Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm
Automatic Documentation for Launcher #1952: Pull request #2825 synchronize by mht-sharma
December 18, 2024 12:05 8m 9s rocm-fp8-tensorwise
December 18, 2024 12:05 8m 9s
Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm
Automatic Documentation for Launcher #1951: Pull request #2825 synchronize by mht-sharma
December 18, 2024 10:50 7m 20s rocm-fp8-tensorwise
December 18, 2024 10:50 7m 20s
Add possible variants for A100 and H100 GPUs for auto-detecting flops
Automatic Documentation for Launcher #1950: Pull request #2837 synchronize by lazariv
December 18, 2024 08:40 Action required lazariv:main
December 18, 2024 08:40 Action required
Improve vlm support (add idefics3 support)
Automatic Documentation for Launcher #1949: Pull request #2437 synchronize by drbh
December 18, 2024 05:06 7m 34s improve-vlm-support
December 18, 2024 05:06 7m 34s
Improve vlm support (add idefics3 support)
Automatic Documentation for Launcher #1948: Pull request #2437 synchronize by drbh
December 18, 2024 03:25 7m 17s improve-vlm-support
December 18, 2024 03:25 7m 17s
Enable qwen2vl video
Automatic Documentation for Launcher #1947: Pull request #2756 synchronize by drbh
December 18, 2024 01:41 9m 57s enable-qwen2vl-video
December 18, 2024 01:41 9m 57s
Improve vlm support (add idefics3 support)
Automatic Documentation for Launcher #1946: Pull request #2437 synchronize by drbh
December 18, 2024 01:36 8m 15s improve-vlm-support
December 18, 2024 01:36 8m 15s
Improve vlm support (add idefics3 support)
Automatic Documentation for Launcher #1945: Pull request #2437 synchronize by drbh
December 17, 2024 19:44 7m 7s improve-vlm-support
December 17, 2024 19:44 7m 7s