url whitelist match #11
Labels
No Label
ci
priority
critical
priority
high
priority
low
status
abandonned
status
accepted
status
help_wanted
status
in_progress
status
refused
status
review_needed
type
bug
type
discussion
type
enhancement
type
question
type
refactor
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: poc/crieur#11
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
For now, only the hostname is checked. Some URLs might have the right hostname but not be articles (configuration pages, et c).
It would be nice to have a mechanism to avoid to avoid downloading malformed pages.
Maybe it could be part of
Metadata
for newspapers, as anOption
.For the implementation, it could be :
body > div.article
, et c)Trait
with a function checking the page (url, body). It is the more generic but would require helpers function (¿using other ideas in the list?) to help it's implementation.