最新消息:雨落星辰是一个专注网站SEO优化、网站SEO诊断、搜索引擎研究、网络营销推广、网站策划运营及站长类的自媒体原创博客

huggingface transformers - Removing non-English languages from Llama - Stack Overflow

programmeradmin4浏览0评论

I'm working with the meta-llama/Llama-3.2-1B model from Hugging Face Transformers and I only need it to support English. I was wondering if it's possible to remove all the other languages from this model and fine tune it for English-only use cases.

Would doing this result in better memory efficiency and speed during both training and inference? If so, could you provide guidance on how to identify and remove the non-English tokens and language data from the model and tokenizer.

Thanks in advance for your help!

发布评论

评论列表(0)

  1. 暂无评论