1. Nov 20th, 2005

    Microformat Parser for Ruby

    A Ruby library for creating parsers that can be used to extract microcontent from (X)HTML documents in a variety of microformats.

    This library has been out for a while, but now it has an official documentation page, and a RubyGem you can download from RubyForge.

    Oh, and gem install uformatparser is coming up, as soon as I figure out how to make it work.

    Update: Thanks to Tom Copeland of RubyForge for pointing me at the source of the problem. You can now install the Microformat Parser as a Gem with

    gem install uformatparser 

    Do I need to mention that RubyForge/RubyGem is one of the killer features of Ruby?

    Image by mollyeh11.

    1. Nov 21st, 2005

      Tom Copeland

      Thanks, Assaph! Bummer about the Gems/YAML thing, but hopefully 1.8.4 will clear that up…

    2. Jul 6th, 2006

      Labnotes » Blog Archive » The UI is the API: Scraping with Ruby @ Mashup Camp II

      [...] Once I run out of colorful metaphors, I’m going to show you how to get eBay listing data in 10 lines of Ruby code. And give you a run down of the scrAPI toolkit for Ruby, which is going 1.0 next week (2.0 if you count uformatparser). [...]

    3. Jul 11th, 2006

      Labnotes » Blog Archive » Scraping with style: scrAPI toolkit for Ruby

      [...] Last year I started using CSS selectors for the microformat parser. It worked very well for microformats, which are better structured than your average HTML page. [...]

    4. Jun 7th, 2007

      code elements » Microformats: Path to Sematic Web?

      [...] Ruby microformat [...]

    Your comment, here ⇓