diff options
| author | 2022-05-12 22:15:10 +0200 | |
|---|---|---|
| committer | 2022-05-12 22:15:10 +0200 | |
| commit | 4a87206f2898665e99953590536cedc6c5505f05 (patch) | |
| tree | 398f53769048460071194d398c61c7e847f22d7e /docs | |
| parent | 9d1930d9adb4f56ae12209d3d01f4a1ed1af8503 (diff) | |
OPML export/import of some proprietary FreshRSS attributes (#4342)
* OPML export/import of some proprietary FreshRSS attributes
#fix https://github.com/FreshRSS/FreshRSS/issues/4077
And one of the TODOs of https://github.com/FreshRSS/FreshRSS/pull/4220
XPath options, CSS Selector, and action filters
* Bump library patch version
* OPML namespace + documentation
* Add example
Diffstat (limited to 'docs')
| -rw-r--r-- | docs/en/developers/OPML.md | 74 |
1 files changed, 74 insertions, 0 deletions
diff --git a/docs/en/developers/OPML.md b/docs/en/developers/OPML.md new file mode 100644 index 000000000..59a59a748 --- /dev/null +++ b/docs/en/developers/OPML.md @@ -0,0 +1,74 @@ +# OPML in FreshRSS + +FreshRSS supports the [OPML](https://en.wikipedia.org/wiki/OPML) format to export and import lists of RSS/Atom feeds in a standard way, compatible with several other RSS aggregators. + +However, FreshRSS also supports several additional features not covered by the basic OPML specification. +Luckily, the [OPML specification](http://opml.org/spec2.opml) allows extensions: + +> *An OPML file may contain elements and attributes not described on this page, only if those elements are defined in a namespace.* + +and: + +> *OPML can also be extended by the addition of new values for the type attribute.* + +## FreshRSS OPML extension + +FreshRSS uses the XML namespace <https://freshrss.org/opml> to export/import extended information not covered by the basic OPML specification. + +The list of the custom FreshRSS attributes can be seen in [the source code](https://github.com/FreshRSS/FreshRSS/blob/edge/app/views/helpers/export/opml.phtml), and here is an overview: + +### HTML+XPath + +* `<outline type="HTML+XPath" ...`: Additional type of source, which is not RSS/Atom, but HTML Web Scraping using [XPath](https://www.w3.org/TR/xpath-10/) 1.0. + +> ℹ️ [XPath 1.0](https://en.wikipedia.org/wiki/XPath) is a standard query language, which FreshRSS supports to enable [Web scraping](https://en.wikipedia.org/wiki/Web_scraping). + +The following attributes are using similar naming conventions than [RSS-Bridge](https://rss-bridge.github.io/rss-bridge/Bridge_API/XPathAbstract.html). + +* `frss:xPathItem`: XPath expression for extracting the feed items from the source page. + * Example: `//div[@class="news-item"]` +* `frss:xPathItemTitle`: XPath expression for extracting the feed title from the source page. + * Example: `descendant::h2` +* `frss:xPathItemContent`: XPath expression for extracting an item’s content from the item context. + * Example: `.` +* `frss:xPathItemUri`: XPath expression for extracting an item link from the item context. + * Example: `descendant::a/@href` +* `frss:xPathItemAuthor`: XPath expression for extracting an item author from the item context. + * Example: `"Anonymous"` +* `frss:xPathItemTimestamp`: XPath expression for extracting an item timestamp from the item context. The result will be parsed by [`strtotime()`](https://php.net/strtotime). +* `frss:xPathItemThumbnail`: XPath expression for extracting an item’s thumbnail (image) URL from the item context. + * Example: `descendant::img/@src` +* `frss:xPathItemCategories`: XPath expression for extracting a list of categories (tags) from the item context. + +### Miscellaneous + +* `frss:cssFullContent`: [CSS Selector](https://developer.mozilla.org/en-US/docs/Web/CSS/CSS_Selectors) to enable the download and extraction of the matching HTML section of each articles’ Web address. + * Example: `div.main` +* `frss:filtersActionRead`: List (separated by a new line) of search queries to automatically mark a new article as read. + +### Example + +```xml +<?xml version="1.0" encoding="UTF-8"?> +<opml version="2.0"> + <head> + <title>FreshRSS OPML extension example</title> + </head> + <body> + <outline xmlns:frss="https://freshrss.org/opml" + text="Example" + type="HTML+XPath" + xmlUrl="https://www.example.net/page.html" + htmlUrl="https://www.example.net/page.html" + description="Example of Web scraping" + frss:xPathItem="//a[contains(@href, '/interesting/')]/ancestor::article" + frss:xPathItemTitle="descendant::h2" + frss:xPathItemContent="." + frss:xPathItemUri="descendant::a[string-length(@href)>0]/@href" + frss:xPathItemThumbnail="descendant::img/@src" + frss:cssFullContent="article" + frss:filtersActionRead="intitle:⚡️ OR intitle:🔥 something" + /> + </body> +</opml> +``` |
