在测试场景中往往会需要生成一段随机的段落,每个段落的单词是实际的英文单词,不是随机的字母,这时就用到了python的random模块和nltk库。
如果在代码中使用如下语句下载资源库时会报错:
nltk.download('words')
nltk.download('brown')
[nltk_data] Error loading words: <urlopen error [Errno 61] Connection
[nltk_data] refused>
[nltk_data] Error loading brown: <urlopen error [Errno 61] Connection
[nltk_data] refused>
Traceback (most recent call last):
File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/python3.12/site-packages/nltk/corpus/util.py", line 84, in __load
root = nltk.data.find(f"{self.subdir}/{zip_name}")
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/python3.12/site-packages/nltk/data.py", line 579, in find
raise LookupError(resource_not_found)
LookupError:
**********************************************************************
Resource words not found.
Please use the NLTK Downloader to obtain the resource:
>>> import nltk
>>> nltk.download('words')
For more information see: https://www.nltk.org/data.html
Attempted to load corpora/words.zip/words/
Searched in:
- '/Users/testmanzhang/nltk_data'
- '/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/nltk_data'
- '/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/share/nltk_data'
- '/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/nltk_data'
- '/usr/share/nltk_data'
- '/usr/local/share/nltk_data'
- '/usr/lib/nltk_data'
- '/usr/local/lib/nltk_data'
**********************************************************************
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/unittest_aosu.py", line 17, in <module>
word_list = words.words()
^^^^^^^^^^^
File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/python3.12/site-packages/nltk/corpus/util.py", line 120, in __getattr__
self.__load()
File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/python3.12/site-packages/nltk/corpus/util.py", line 86, in __load
raise e
File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/python3.12/site-packages/nltk/corpus/util.py", line 81, in __load
root = nltk.data.find(f"{self.subdir}/{self.__name}")
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/python3.12/site-packages/nltk/data.py", line 579, in find
raise LookupError(resource_not_found)
LookupError:
**********************************************************************
Resource words not found.
Please use the NLTK Downloader to obtain the resource:
>>> import nltk
>>> nltk.download('words')
For more information see: https://www.nltk.org/data.html
Attempted to load corpora/words
Searched in:
- '/Users/testmanzhang/nltk_data'
- '/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/nltk_data'
- '/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/share/nltk_data'
- '/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/nltk_data'
- '/usr/share/nltk_data'
- '/usr/local/share/nltk_data'
- '/usr/lib/nltk_data'
- '/usr/local/lib/nltk_data'
**********************************************************************
使用提示信息中的方法也是不行:
>>> import nltk
>>> nltk.download('words')
[nltk_data] Error loading words: <urlopen error [Errno 61] Connection
[nltk_data] refused>
False
后来访问了nltk data的地址:https://www.nltk.org/nltk_data/,手动下载了资源:
下载完成后时一个zip文件:words.zip
解压后放到错误提示信息中的目录中,例如,'/Users/testmanzhang/nltk_data'
在我的本地并没有nltk_data这个目录,于是就手动创建了一个在这个目录下还要创建corpora,将解压后的words放到这个目录中:
/Users/testmanzhang/nltk_data/corpora
这时候再调试就OK了~