题意:使用 Python、AutoGPT 和 ChatGPT 从下载的 HTML 页面中提取数据。
问题背景:
Note: If you're downvoting at least share why. I put in a lot of effort to write this question, shared my code and did my own research first, so not sure what else I could add.
注意:如果你要反对投票,请至少分享原因。我花了很多精力写这个问题,分享了我的代码并进行了自己的研究,所以不确定我还能添加什么。
I already use Scrapy to crawl websites successfully. I extract specific data from a webpage using CSS selectors. However, it's time consuming to setup and error prone. I want to be able to pass the raw HTML to chatGPT and ask a question like
我已经成功使用 Scrapy 爬取网站。我通过 CSS 选择器从网页中提取特定数据。然而,设置过程耗时且容易出错。我希望能够将原始 HTML 传递给 ChatGPT,并问一个类似于以下的问题: