Skip to content

Pull requests: ggerganov/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add Fedora CUDA Guide for Development in Toolbox Environment documentation Improvements or additions to documentation
#11135 opened Jan 8, 2025 by teihome Loading…
vulkan: optimize coopmat2 q2_k dequant function ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#11130 opened Jan 7, 2025 by jeffbolznv Loading…
SYCL: Refactor ggml_sycl_compute_forward ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#11121 opened Jan 7, 2025 by qnixsynapse Loading…
llama : functions -> methods devops improvements to build systems and github actions
#11110 opened Jan 6, 2025 by ggerganov Draft
1 of 2 tasks
feat(ci): add visionOS build workflow devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning
#11103 opened Jan 6, 2025 by ggerganov Loading…
vulkan: scale caching for k quants + misc fixes ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#11081 opened Jan 5, 2025 by netrunnereve Loading…
Remove obsolete HIP workaround build Compilation issues devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment Nvidia GPU Issues specific to Nvidia GPUs
#11080 opened Jan 5, 2025 by sARY77 Loading…
feat(ci): add visionOS build workflow devops improvements to build systems and github actions
#11065 opened Jan 3, 2025 by sinkingsugar Loading…
llama : remove notion of CLS token python python script changes
#11064 opened Jan 3, 2025 by ggerganov Loading…
android : Apply chat template android Issues specific to Android examples
#11059 opened Jan 3, 2025 by Dhruvanand24 Loading…
CUDA Graph Compute Function Refactor (precursor for performance improvements) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#11042 opened Jan 2, 2025 by aendk Draft
Add VisionOS compatibility by adding missing type definitions Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#11019 opened Dec 30, 2024 by sinkingsugar Loading…
Add Jinja template support examples python python script changes script Script related server
#11016 opened Dec 30, 2024 by ochafik Draft
3 tasks
model: Add support for PhiMoE arch documentation Improvements or additions to documentation enhancement New feature or request model Model specific python python script changes
#11003 opened Dec 28, 2024 by phymbert Loading…
Add support for QRWKV6 hybrid models & slight optimization for RWKV6 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#11001 opened Dec 28, 2024 by MollySophia Loading…
Vulkan: Destroy Vulkan instance on exit ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#10989 opened Dec 26, 2024 by 0cc4m Loading…
server : (web ui) Enable gzip compression for local storage demo Demonstrate some concept or idea, not intended to be merged examples server
#10945 opened Dec 22, 2024 by exxocism Loading…
2 tasks
Allow user to compile with any cuda version using github actions devops improvements to build systems and github actions
#10928 opened Dec 21, 2024 by jianlins Loading…
ASCII/Romanization for OuteTTS Multilingual Processing demo Demonstrate some concept or idea, not intended to be merged examples
#10894 opened Dec 19, 2024 by edwko Loading…
llama: Ensure KV cache is fully defragmented.
#10873 opened Dec 17, 2024 by jessegross Loading…
SYCL: Fixes for building SYCL backend for AMD GPUs documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#10851 opened Dec 16, 2024 by lhl Loading…
Fix compilation on Pop!_OS 22.04 LTS CUDA ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#10835 opened Dec 15, 2024 by mika314 Loading…
ProTip! Exclude everything labeled bug with -label:bug.