Website group article content collection software, Taoxiaobai Baidu knows aggregation article collection software documentation (2)

Previous article: Baidu Zhizhi aggregation article collection software documentation

Hello everyone, I am Taoxiaobai. Baidu knows the instructions for aggregating article collection software and wrote them very simply for you. Because time is tight and there are other software scripts to write, I only gave you a brief introduction. Let me give you a detailed introduction below!

1. Software language: Python

2. Collect articles known to Baidu in batches based on keywords. A large amount of data can be generated. Conservative testing can produce 100,000+ data a day, which is suitable for website groups.

3. Put the organized keywords into keywords.txt, one per line, encoded in UTF8

4. The keywords we have sorted out need to be cleaned. To remove keywords such as videos and pictures, the free cleaning software in my QQ group files can be downloaded and used. ❤TXB2196

After cleaning the keywords, double-click main.py directly

5. Screenshots of execution effects 

6. The generated article is saved under the data folder, and has been automatically aggregated and matched with pictures. The effect is as follows:

 The pictures used are links from Sogou Gallery, so you can use them with confidence. All the pictures called have been checked to see if they can be opened normally. Remote links will not affect the collection.

7. Double title = keyword + Baidu knows the title

It is recommended to use the keyword database of 5118 or other websites.

8. The software is not connected to the website for publishing. It can only save the data to local txt and publish it separately later. It can be used by Locomotive, Shuimiao, or other software/scripts/plug-ins.

9. All friends who have purchased it, please use it as soon as possible, don’t delay, and store the data as soon as possible.

10. Some customers who have used it said that it crashed because there was a problem with the file encoding.

keywords.txt config.ini Try to use notepad++ to edit. Do not use Notepad to edit. Notepad editing may cause coding problems.

solution:

① Use notepad++ to open keywords.txt config.ini and encode it into utf8

②. Delete keywords.txt config.ini and re-unzip it.

11. This software needs to be bound to the computer as before.

12. Customer feedback:

 

 

 

Guess you like

Origin blog.csdn.net/u012917925/article/details/132253389