Detailed python3 + Scrapy crawler learning to create projects

Recently prepared to do a battle on scrapy framework, crawl Tencent social recruiting information and storage, this blog to record the steps to create the project

It is not possible to create a scrapy project with pycharm.

Therefore, we need to create a new scrapy project using the command line method

Make sure you have scrapy, twisted, pypiwin32 installed.

I: Go to the path you need, which stores the project you created

Mine will be in the Scrapy directory on the E drive.

Second: create the project: scrapy startproject *** (this is the project name)

This creates a project called tencent.

Third: enter the project to create a new crawler: scrapy genspider tencent_spider

Here we should note that for the command above, the bolded is the name of the crawler and the italicized is the domain name

In this way, we have created a new crawler project, open the folder to view the

Open spiders

Then we open it with pycharm

Click File ->open, find the folder where the project is located, and open it!

In this way, we have created a new scrapy project, if you install the required libraries, scrapy red, remember to go to switch the interpreter

In File->settings marked in red

It is also recommended that you create a new file in the root directory and write the

from scrapy import cmdline
 
("scrapy crawl tencent_spider".split())

This way, we can just run, run, run each time, without having to go to the command line to execute the run command

It can be created by right mouse clicking on tencent and creating a new python file.

The above is a small introduction to the python3 Scrapy crawler to create a project detailed integration, I hope to help you, if you have any questions please leave me a message, I will reply to you in a timely manner. I would also like to thank you very much for your support of my website!