Home | Javadocs | Quickstart Tutorial | FAQ | Download Jaunt

Jaunt Java Web Scraping & Automation API

Introduction
July 2, 2014
0.9.7.1 Release!

Test drive Jaunt today and leave feedback in the forum to help shape the next release!
Jaunt Beta is a new, free, Java web-scraping/automation library. The API presents a lightweight, headless browser for interfacing with websites, web-apps, and web services. Jaunt makes it easy to parse, traverse, search, extract and filter HTML & XML data. It provides three levels of abstraction: DOM-level, component-level, and browser-level. It is an ideal API for web automation where Javascript is not required, including:
  • filling out and submitting forms
  • creating web-bots or web-scraping programs.
  • creating REST clients for XML services.
  • interfacing with web-based APIs or web-apps.
  • automated testing.
Features:
Jaunt Beta is free [see product comparison]. Features include:
  • HTML, XHTML, XML parsing.
  • Protocols: HTTP, HTTPS, basic auth.
  • Form completion via field labels/names.
  • Automatic form permutation.
  • File downloading/uploading
  • Table data extraction.
  • DOM navigation, search & search chaining.
  • Regular-expression-enabled querying
  • HTTP header/cookie manipulation
  • HTTP proxy support.
  • Customizable caching & content handlers
  • 100% Java (no dependencies)