Commit Graph

11 Commits

Author SHA1 Message Date
e34edf0b21
fix: keep media queries in ref styles
All checks were successful
continuous-integration/drone/push Build is passing
Previously, media queries weren't keep when downloading styles from ref
tags.

It have been fixed so that media attribute are kept when creating style
tags from ref tags.
2021-05-22 04:41:08 +02:00
40ebc1ddea
feat: allow to inject styles 2021-05-22 04:41:08 +02:00
6e091a32fc
chore: use a config struct for self_contained_html
Previously, self_html_function was a function taking all parameters as
arguments.
As new optionnal parameters are beeing added, the function had too much
arguments and each usage of the function would have to be modified each
time an argument will be added.

Therefore, it have been moved to a configuration structure with a `run`
function taking only one argument, the html string.
2021-05-22 04:41:08 +02:00
5d0872b4d9
feat : add retrieve from courrier international
Retrieval of articles from courrier international have been added
2021-05-22 04:41:08 +02:00
cee0af6c3c
fix: only select images that have non-data src
Previously, when the image url contained data, it tried to parse an url
and failed, instead of keeping data.

It have been fixed so that images where url is starting by 'data' are
not modified.
2021-05-22 04:41:08 +02:00
970f510cd1
feat: add retrieval from le monde diplomatique
Add retrieval from le monde diplomatique

Previously, 404 pages were injected in the document when downloading
styles
Now, the downloader returns None when documents are not found
2021-05-22 04:41:01 +02:00
b1d025a23c
feat: move ArticleLocation to defined errors
All checks were successful
continuous-integration/drone/push Build is passing
Previously, the functions in article_location.rs where returning generic
anyhow::Result.

In order to ease error handling when using the library, it have been
moved to specific errors.
2021-04-30 19:55:52 +02:00
865b949b5f
feat: add builder for mediapart, document chatbot
All checks were successful
continuous-integration/drone/pr Build is passing
A builder for mediapart have been added. No generic builder have been
created as there is no usecase yet.

Some documentation have been added, roadmap and scope have been
clarified and chatbot have been lightly documented.
2021-04-29 02:11:32 +02:00
a16dbbc790
feat: add basic chatbot
Some checks failed
continuous-integration/drone/pr Build is running
continuous-integration/drone/push Build is failing
A basic chabot application that downloads article from one newspaper
have been added.

It can download html pages and is called with !hmtl

ArticleLocation have been refactored to own it's internal data.
2021-04-27 04:32:37 +02:00
756b1592b7
feat: allows to remove elements of html pages
A feature to remove elements of html pages based on css selectors have
been added.

The removal of link element that load external js have been added.
2021-04-24 03:45:13 +02:00
c4ab210c4d
feat: add retrieval application and one newspaper
A first example as well as some documentation have been added

The first example builds an article location and download the article as
an html String.

The documentation explains how it has been designed and what is the goal
of the application as well as it's intended architecture
2021-04-23 22:12:02 +02:00