Commits: examples/quantize-stats/quantize-stats.cpp - ggml-org/llama.cpp - Morph

SIGN IN SIGN UP

ggml-org / llama.cpp UNCLAIMED

LLM inference in C/C++

0 0 0 C++

COMMITS

/ examples/quantize-stats/quantize-stats.cpp

gguf-py

November 14, 2024

D

ggml : build backends as libraries (#10256)

Diego Devesa committed 1y ago

October 8, 2024

D

ggml : fix BLAS with unsupported types (#9775)

Diego Devesa committed 1y ago

September 7, 2024

G

llama : refactor sampling v2 (#9294)

Georgi Gerganov committed 1y ago

July 12, 2024

G

ggml : minor naming changes (#8433)

Georgi Gerganov committed 1y ago

April 29, 2024

O

Improve usability of --model-url & related flags (#6930)

Olivier Chafik committed 1y ago

February 3, 2024

M

refactor : switch to emplace_back to avoid extra object (#5291)

Michael Klimenko committed 2y ago

January 30, 2024

K

SOTA 3-bit quants (#5196)

Kawrakow committed 2y ago

December 7, 2023

G

llama : per-layer KV cache + quantum K cache (#4309)

Georgi Gerganov committed 2y ago

November 2, 2023

C

build : link against build info instead of compiling against it (#3879)

cebtenzzre committed 2y ago

September 28, 2023

S

llama.cpp : split llama_context_params into model and context params (#3301)

slaren committed 2y ago

September 18, 2023

C

make : restore build-info.h dependency for several targets (#3205)

Cebtenzzre committed 2y ago

September 15, 2023

C

examples : add compiler version and target to build info (#2998)

Cebtenzzre committed 2y ago

C

check C++ code with -Wmissing-declarations (#3184)

Cebtenzzre committed 2y ago

September 7, 2023

C

fix some warnings from gcc and clang-tidy (#3038)

Cebtenzzre committed 2y ago

August 21, 2023

G

gguf : new file format with flexible meta data (beta) (#2398)

Georgi Gerganov committed 2y ago

July 5, 2023

S

ggml : generalize `quantize_fns` for simpler FP16 handling (#1237)

Stephan Walter committed 2y ago

June 24, 2023

D

llama : make model stateless and context stateful (llama_state) (#1797)

Didzis Gosko committed 2y ago

June 16, 2023

B

build : fix and ignore MSVC warnings (#1889)

Borislav Stanimirov committed 2y ago

June 5, 2023

K

ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)

Kawrakow committed 2y ago

May 17, 2023

S

Remove unused n_parts parameter (#1509)

Stephan Walter committed 2y ago

May 1, 2023

D

Add git-based build information for better issue tracking (#1232)

DannyDaemonic committed 2y ago

April 20, 2023

K

llama : multi-threaded quantization (#1075)

Kawrakow committed 2y ago

April 17, 2023

G

quantize-stats : fix bug in --type argument

Georgi Gerganov committed 3y ago

April 14, 2023

P

Expose type name from ggml (#970)

Pavol Rusnak committed 3y ago

April 13, 2023

G

llama : merge llama_internal.h into llama.h

Georgi Gerganov committed 3y ago

April 8, 2023

C

Rewrite loading code to try to satisfy everyone:

comex committed 3y ago

April 7, 2023

U

Add quantize-stats command for testing quantization (#728)

unbounded committed 3y ago