【发布时间】:2023-01-04 15:44:59
【问题描述】:
我想使用此代码Fine-tune我的model
from huggingsound import TrainingArguments, ModelArguments, SpeechRecognitionModel, TokenSet
model = SpeechRecognitionModel("facebook/wav2vec2-large-xlsr-53")
output_dir = "my/finetuned/model/output/dir"
tokens = ["a", "b", ... "y", "z", "'"]
token_set = TokenSet(tokens)
train_data = [
{"path": "/path/to/sagan.mp3", "transcription": "some text"},
{"path": "/path/to/asimov.wav", "transcription": "some text"},
]
eval_data = [
{"path": "/path/to/sagan.mp3", "transcription": "some text"},
{"path": "/path/to/asimov.wav", "transcription": "some text"},
]
model.finetune(
output_dir,
train_data=train_data,
eval_data=eval_data,
token_set=token_set,
)
它在RAM 上运行,我想使用Colab-GPU 来训练这个模型
【问题讨论】:
标签: google-colaboratory huggingface-transformers fine-tune