~patrickpichler/clojure-readability

Pure clojure implementation of arc90 readability tool.

aef8461 Add basic monadic parser implementation

6 months ago

42ff687 Add further explanation and goals to README

6 months ago

#clojure-readability

Pure clojure implementation of arc90 readability tool.

The idea is to have a all done in pure clojure, without any dependencies. This includes a basic html parser.

#Goals

  • Be able to parse any valid html
  • Extract main text from the article and get rid of all the noise

#Non-Goals

  • Performance (it is fine if it takes ages to parse the html)

#Special thanks

Special thanks to Oleksii Kachaiev for providing this awesome turorial on how to write a monadic parser in clojure.