Low bit quantization type #84

Open

opened

on Dec 28, 2024

As LLM becomes larger and larger, some highly quantized types have emerged in recent years, such as FP8, Q4.

Should tensor-type add these definitions, or provide custom(string) enumeration

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Fields

No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests