Commit Graph

5 Commits

Author SHA1 Message Date
970f510cd1
feat: add retrieval from le monde diplomatique
Add retrieval from le monde diplomatique

Previously, 404 pages were injected in the document when downloading
styles
Now, the downloader returns None when documents are not found
2021-05-22 04:41:01 +02:00
865b949b5f
feat: add builder for mediapart, document chatbot
All checks were successful
continuous-integration/drone/pr Build is passing
A builder for mediapart have been added. No generic builder have been
created as there is no usecase yet.

Some documentation have been added, roadmap and scope have been
clarified and chatbot have been lightly documented.
2021-04-29 02:11:32 +02:00
a16dbbc790
feat: add basic chatbot
Some checks failed
continuous-integration/drone/pr Build is running
continuous-integration/drone/push Build is failing
A basic chabot application that downloads article from one newspaper
have been added.

It can download html pages and is called with !hmtl

ArticleLocation have been refactored to own it's internal data.
2021-04-27 04:32:37 +02:00
756b1592b7
feat: allows to remove elements of html pages
A feature to remove elements of html pages based on css selectors have
been added.

The removal of link element that load external js have been added.
2021-04-24 03:45:13 +02:00
c4ab210c4d
feat: add retrieval application and one newspaper
A first example as well as some documentation have been added

The first example builds an article location and download the article as
an html String.

The documentation explains how it has been designed and what is the goal
of the application as well as it's intended architecture
2021-04-23 22:12:02 +02:00