【发布时间】:2019-10-12 01:54:53
【问题描述】:
我正在尝试使用SciBERT预训练模型,即:scibert-scivocab-uncased 方式如下:
!pip install pytorch-pretrained-bert
import torch
from pytorch_pretrained_bert import BertTokenizer, BertModel, BertForMaskedLM
import logging
import matplotlib.pyplot as plt
tokenizer = BertTokenizer.from_pretrained('bert-base-uncased')
indexed_tokens = tokenizer.convert_tokens_to_ids(tokenized_text)
segments_ids = [1] * len(tokenized_text)
tokens_tensor = torch.tensor([indexed_tokens])
segments_tensors = torch.tensor([segments_ids])
model = BertModel.from_pretrained('/Users/.../Downloads/scibert_scivocab_uncased-3.tar.gz')
我收到以下错误:
EOFError: Compressed file ended before the end-of-stream marker was reached
我从网站(https://github.com/allenai/scibert)下载了文件
我将它从“tar”转换为gzip
没有任何效果。
关于如何解决这个问题的任何提示?
谢谢!
【问题讨论】:
标签: error-handling neural-network nlp tar word-embedding