大模型微调bitsandbytes报错（Windows有关cuda版本等）

设计、matlab appdesigner，gui设计、simulink仿真......希望能帮到你！大语言模型(LLaMa、qwen等)进行微调时，考虑到减少显存占用，会使用如下方式加载模型。小编会不定期发布相关设计内容包括但不限于如下内容:信号处理、通信仿真、

小英熊M

2127人浏览 · 2024-12-13 10:38:30

小英熊M · 2024-12-13 10:38:30 发布

大语言模型(LLaMa、qwen等)进行微调时，考虑到减少显存占用，会使用如下方式加载模型。

quantization_config = BitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_quant_type="nf4",
    bnb_4bit_use_double_quant=True,
    bnb_4bit_compute_dtype=torch.bfloat16
)
model = AutoModelForCausalLM.from_pretrained(
    model_dir,
    use_cache=False,
    device_map="cuda:0",
    torch_dtype=torch.bfloat16,
    quantization_config=quantization_config)

网上搜了若干方法，依旧报错，信息大致如下：

RuntimeError:
CUDA Setup failed despite GPU being available. Please run the following command to get more information: 
python -m bitsandbytes
Inspect the output of the command and see if you can locate CUDA libraries. You might need to add them to your LD_LIBRARY_PATH. ...


attributeerror: module 'bitsandbytes.nn' has no attribute 'linear4bit'

the `load_in_4bit` and `load_in_8bit` arguments are deprecated 
and will be removed in the future versions. 
please, pass a `bitsandbytesconfig` object in `quantization_config` argument instead.

attributeerror: 'nonetype' object has no attribute 'cquantize_blockwise_bf16_nf4'

the installed version of bitsandbytes was compiled without gpu support. 8-bit optimizers, 8-bit multiplication, and gpu quantization are unavailable.

终极解决办法：

pip uninstall bitsandbytes
pip install https://github.com/jllllll/bitsandbytes-windows-webui/releases/download/wheels/bitsandbytes-0.41.0-py3-none-win_amd64.whl

关于其他版本可自行查看下载：

https://github.com/jllllll/bitsandbytes-windows-webui/releases/

最后：

小编会不定期发布相关设计内容包括但不限于如下内容:信号处理、通信仿真、算法设计、matlab appdesigner，gui设计、simulink仿真......希望能帮到你！

技术共进，成长同行——讯飞AI开发者社区

更多推荐

从需求到优化：AI应用架构师的模型生命周期实战指南

在当今AI驱动的世界中，构建一个成功的人工智能系统远不止是训练一个高精度的模型那么简单。从最初的需求分析到模型部署后的持续优化，AI应用架构师需要掌握一套完整的模型生命周期管理方法论。本文将带领读者深入探索AI模型的完整生命周期，剖析每个阶段的核心挑战、最佳实践和实用工具。通过丰富的案例分析和代码示例，我们将展示如何将业务需求转化为技术规格，如何设计可扩展的数据架构，如何选择合适的模型策略，以及如

讯飞AI开发者社区

确保在运行环境中有Python解释器

人工智能（Artificial Intelligence，AI）是通过计算机系统模拟人类智能的技术，涵盖学习、推理、问题解决和决策等能力。其核心目标是让机器执行通常需要人类智慧的任务。如需更具体领域的细节（如技术实现或行业案例），可进一步说明需求。函数获取用户输入的内容，存储在变量。函数获取用户输入的内容，存储在变量。将输入内容转换为大写，存储在。将输入内容转换为大写，存储在。函数输出处理后的结果