-
Notifications
You must be signed in to change notification settings - Fork 12
Open
Description
File "D:\project\ace2005chinese_preprocess\main.py", line 3, in
File "D:\project\ace2005chinese_preprocess\main.py", line 296, in preprocessing
parser = Parser(path=file)
File "D:\project\ace2005chinese_preprocess\main.py", line 22, in init
self.sents_with_pos = self.parse_sgm(path + '.sgm')
File "D:\project\ace2005chinese_preprocess\main.py", line 95, in parse_sgm
soup = BeautifulSoup(f.read(), features='html.parser')
UnicodeDecodeError: 'gbk' codec can't decode byte 0x88 in position 168: illegal multibyte sequence
Metadata
Metadata
Assignees
Labels
No labels