不知道为什么会出现此问题。
from mosestokenizer import MosesDetokenizer
with MosesDetokenizer('en') as detokenize:
print(detokenize(["hi", 'my', 'name', 'is', 'artem']))
这就是我得到的:
stdbuf was not found; communication with perl may hang due to stdio buffering.
Traceback (most recent call last):
File "C:\Users\ArtemLaptiev\Documents\GitHub\temp\foo.py", line 3, in <module>
with MosesDetokenizer('en') as detokenize:
File "C:\ProgramFiles\Anaconda\lib\site-packages\mosestokenizer\detokenizer.py", line 47, in __init__
super().__init__(argv)
File "C:\ProgramFiles\Anaconda\lib\site-packages\toolwrapper.py", line 52, in __init__
self.start()
File "C:\ProgramFiles\Anaconda\lib\site-packages\toolwrapper.py", line 92, in start
cwd=self.cwd
File "C:\ProgramFiles\Anaconda\lib\subprocess.py", line 709, in __init__
restore_signals, start_new_session)
File "C:\ProgramFiles\Anaconda\lib\subprocess.py", line 997, in _execute_child
startupinfo)
FileNotFoundError: [WinError 2] The system cannot find the file specified
谢谢您的帮助!
答案 0 :(得分:1)
使用sacremoses
代替moses
。
pip install -U sacremoses
和
from sacremoses import MosesTokenizer, MosesDetokenizer
with MosesDetokenizer() as detokenize:
print(detokenize(["hi", 'my', 'name', 'is', 'artem']))
有关完整的详细信息sacremoses