- Notifications
You must be signed in to change notification settings - Fork3
Mixtral-based Ja-En (En-Ja) Translation model
NotificationsYou must be signed in to change notification settings
hppRC/llm-translator
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Released model:https://huggingface.co/hpprc/Mixtral-8x7B-Instruct-ja-en
Trained with A6000 (48Gib) x 4 for 6 hours.
rye syncpip install flash-attn --no-build-isolationpip install "multiprocess==0.70.15"
bash datasets/download.sh# parse original html files to avoid noises from pre-tokenized textspython src/misc/parse_htm.py
accelerate launch --config_file accelerate.json src/train.py
- use QLoRA (r=16, targeted all linears)
- use NEFTune (α=5.0)
- use TRL
python src/demo.py# Ja > どうもこんにちは、ニンジャスレイヤーです# Hello, I'm Ninja Slayer.
importtorchfrompeftimportPeftModelfromtransformersimportAutoModelForCausalLM,AutoTokenizer@torch.inference_mode()defmain():model_id="mistralai/Mixtral-8x7B-Instruct-v0.1"peft_id="hpprc/Mixtral-8x7B-Instruct-ja-en"tokenizer=AutoTokenizer.from_pretrained(model_id)tokenizer.pad_token=tokenizer.unk_tokenmodel=AutoModelForCausalLM.from_pretrained(model_id,torch_dtype=torch.bfloat16,device_map="auto",use_cache=True, ).eval()model=PeftModel.from_pretrained(model=model,model_id=peft_id)messages= [# {"role": "user", "content": "Translate this English sentence into Japanese.\n" + input("En > ")}, {"role":"user","content":"Translate this Japanese sentence into English.\n"+input("Ja > ")}, ]prompts=tokenizer.apply_chat_template(messages,tokenize=False,add_generation_prompt=True)inputs=tokenizer(prompts,return_tensors="pt").to("cuda")outputs=model.generate(**inputs,max_new_tokens=128,do_sample=True,top_p=0.95,num_beams=5,pad_token_id=tokenizer.eos_token_id, )gen_ids=outputs[0][len(inputs.input_ids[0]):]out=tokenizer.decode(gen_ids,skip_special_tokens=True)out=out.split("\n")[0]# 生成しすぎることがあるので最初の一文だけ取り出すのがいいかもprint(out)if__name__=="__main__":main()
About
Mixtral-based Ja-En (En-Ja) Translation model
Resources
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Releases
No releases published
Packages0
No packages published
Uh oh!
There was an error while loading.Please reload this page.