Skip to content

8bit model#3

Open
zshobbs wants to merge 2 commits intoSentdex:mainfrom
zshobbs:8bit-model
Open

8bit model#3
zshobbs wants to merge 2 commits intoSentdex:mainfrom
zshobbs:8bit-model

Conversation

@zshobbs
Copy link

@zshobbs zshobbs commented Jan 25, 2023

Run the LLM's over multiple GPUS Using 8bit models to compress the vram footprint. "facebook/opt-30b" runs on 2 nvidia rtx 3090's. "facebook/opt-66b" might squeeze onto bigger GPUs or you can use float16 to and CPU or nvme/ssd offload.

This uses Huggingface accelerate and bitsandbytes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant