libextractor

GNU libextractor
Log | Files | Refs | Submodules | README | LICENSE

wget_lj.txt (340B)


      1 $ wget -q http://www.linuxjournal.com/
      2 $ extract index.html
      3 description - The Monthly Magazine of the Linux Community
      4 keywords - linux, linux journal, magazine
      5 author - Linux Journal  - The Premier Magazine of the Linux Community
      6 title - Linux Journal  - The Premier Magazine of the Linux Community
      7 
      8 Caption: Extracting meta-data from HTML.