Philosophy¶

Cabu was created to be a part of the new kind of applications that make more sense remotely than locally. Following as close as possible the 12 Factors rules makes it more reliable, flexible and trustable being in the cloud.

Because of his nature, and because Cabu was designed using Docker, It can make sense to use Docker tools and ecosystem to build, ship and run your application.

Modular¶

With the minimal configuration, you can crawl a website using requests or aiohttp depending on your need and your Python version. With a webdriver set up, you can crawl with a real browser and you can do more with a little bit of configuration. In few words, this project is modular. You start with almost nothing and it’s up to you to integrate modules or external services.

Some use cases :

Functionaly test a website
Crawl periodically some data
Act as a data mining bot
Crawling an entire website (using several instances)

If your need is to crawl only from a local crawler, there is already a lot of tools to do that, Scrapy is one of the most famous and convenient to use. Cabu aims to make you able to create a crawler in the cloud in very few steps and work.

... And has a best friend¶

Using Docker and Docker-compose tool will make the development kick-off a matter of few seconds. You can find the Dockerfile configuration at the root of the project or here.

I hope you will enjoy this project, cheers !

Philosophy¶

Modular¶

... And has a best friend¶

Cabu

Navigation

Related Topics

Philosophy¶

Modular¶

Cabu is social¶

... And has a best friend¶