bitnet Search Results - Githubissues

235 results
for bitnet

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/BitNet #12

Hallucination for Llama3-8B-1.58-100B-tokens model with both…

## Type of issue - Thanks guys for this awesome work. I was curious to run llama3-8B on my personal CPU, and the performance is quite impressive (nearly 2x llama.cpp for same model size on same HW). …

aahouzi updated 3 weeks ago
5
pytorch/ao #281

Bitnet 1.58 prework, POC, and staging

# Bitnet 1.58 Groundwork After some talks with Saroufim and the cuda mode team working on bitnet, we've outlined a strategy for implementing bitnet 1.58 method into torch. This issue lays the groun…

CoffeeVampir3 updated 5 months ago
2
microsoft/BitNet #10

Relationship to llama.cpp

First of all: CONGRATS ON YOUR AMAZING RESEARCH WORK. Considering that this is using GGML and seems based directly on `llama.cpp`: Why is this a separate project to `llama.cpp`, given that `llama.c…

dokterbob updated 1 week ago
7
ml-explore/mlx #1511

[Feature] Matmul for CPU

Some of the most popular models provide weights in bfloat16, which unfortunately can not load on CPU because `Matmul::eval_cpu` only supports float32. I know CPU support is not on priority, but it …

zcbenz updated 3 weeks ago
1
microsoft/BitNet #23

Feature Request: Local Server to Integrate with AI Chat Inte…

I am developing [llmchat.co](llmchat.co), an open source local first chat interface. We do have integrations with Ollama, and LM Studio but one of the biggest hurdles that our initial users are telli…

harshitlakhani updated 3 weeks ago
2
microsoft/BitNet #59

Real-Time BitNet LLM Response on a Web Interface with Node.j…

A web interface designed for submitting queries and viewing real-time responses through a user-friendly UI. Built with Node.js for the frontend and a Python Socket server for backend processing, the s…

stackblogger updated 3 weeks ago
3
microsoft/BitNet #34

Install/build process should check and assert the requiremen…

The readme states some requirements about python, cmake and clang version. Currently the install/build process does not check if the clang version requirement is satisfied and ubuntu e.g. come with a…

bmerkle updated 3 weeks ago
6
joey00072/ohara #8

About GPU memory usage

Hello. First of all, thanks for sharing a bitnet training code. I have a question about GPU memory usage. As I understanding, bitnet can reduce VRAM usage compared to fp16/bf16 precision. Howev…

JY-CCK updated 7 months ago
1
xjdr-alt/entropix #54

How do we get this into llama cpp? [feature request]

Seems like an absolutely awesome project. I do a lot of domain expert LLM finetuning so this would be amazing to have in my work. What has to be done to get this into common inference engines like lcp…

e-p-armstrong updated 3 weeks ago
1
dhakalnirajan/LLaMA-BitNet #3

Inference mode kernel

The [Training Tips, Code and FAQ](https://github.com/microsoft/unilm/blob/master/bitnet/The-Era-of-1-bit-LLMs__Training_Tips_Code_FAQ.pdf) specifies that `BitLinear` has different `forward()` definiti…

wx02shi updated 1 month ago
1

上一页 1...1 2 3 4 5 6 7...24 下一页

235 results for bitnet

235 results
for bitnet