Package | Description |
---|---|
com.jaunt |
Jaunt [website] is a web-scraping & automation library that provides a lightweight HTTP useragent (headless browser), including JSON parser.
|
com.jaunt.component | |
com.jaunt.util |
Modifier and Type | Class and Description |
---|---|
class |
Document
Represents an HTML, XHTML, or XML Document
|
class |
Elements
A container class for holding search results, which is itself searchable (inheriting search methods from Element).
|
Modifier and Type | Method and Description |
---|---|
Element |
Element.findFirst(String query)
Searches all child/descendant elements and retrieves the first one found to match the query.
|
Element |
Element.findNearestAncestor(String tagNameRegex)
Finds the nearest ancestor Element (working upwards from the current Element) whose tagname matches the specified case insensitive regular expression (using Matcher.matches()).
|
Element |
Node.findNearestCommonAncestor(Node node) |
Element |
Element.getElement(int n)
Retrieves the nth child Element or throws a NotFound Exception if no element exists at the specified position.
|
Element |
Element.getFirst(String query)
Searches only child elements (not all descendants) and retrieves the first child element that was found to match the query.
|
Element |
Node.getParent()
Returns the parent Element.
|
Element |
Element.getRoot()
Returns the root ancestor Element (or Document container, if it exists).
|
Element |
Element.nextSiblingElement()
Retrieves the next sibling Element in the DOM (ie, the next Element that shares the same parent); note that this method is not suitable for iterating through the children of an Elements container, since the those elements may not be siblings in the dom tree.
|
Modifier and Type | Method and Description |
---|---|
List<Element> |
Element.getChildElements()
Returns a list of child Elements, or an empty list if no child elements exist.
|
Iterator<Element> |
Elements.iterator()
returns an iterator over the child Elements.
|
List<Element> |
Elements.toList()
Returns the child Elements as a List.
|
Modifier and Type | Method and Description |
---|---|
Form |
Document.getForm(Element element)
Returns the Form for the specified form Element or throws a NotFound Exception.
|
com.jaunt.component.Table |
Document.getTable(Element tableElement)
Returns a Table component for the specified table Element or throws a NotFound exception if the specified element is not a table.
|
com.jaunt.component.Hyperlink |
Document.nextPageLink(Element container)
returns the next hyperlink in a series of numeric links within the specified container, such as links that represent pages of search results; throws exceptions if no sequences or multiple (non-equivalent) sequences were found.
|
boolean |
Document.nextPageLinkExists(Element container)
returns true if another hyperlink exists in a series of numeric links within the specified container, such as links that represent pages of search results
|
Constructor and Description |
---|
Elements(List<Element> childElements)
Creates a new Elements object who's immediate child elements are those in the specified list.
|
Modifier and Type | Field and Description |
---|---|
Element |
Form.element
The form element that begins the form.
|
Modifier and Type | Method and Description |
---|---|
Element |
Form.getElement()
Returns the form Element for this Form.
|
Constructor and Description |
---|
Form(Element formElement,
List<Element> componentElements,
Document doc,
UserAgent userAgent) |
Constructor and Description |
---|
Form(Element formElement,
List<Element> componentElements,
Document doc,
UserAgent userAgent) |
Modifier and Type | Method and Description |
---|---|
boolean |
FilterCallback.childCommentAllowed(Element parent,
Comment comment)
Deprecated.
whether the specified child comment for the specified parent element is allowed through the filter.
|
boolean |
Filter.childCommentAllowed(Element parent,
Comment comment)
Deprecated.
satisfies FilerCallback interface.
|
boolean |
FilterCallback.childElementAllowed(Element parent,
Element child)
Deprecated.
whether the specified child element for the specified parent element is allowed through the filter.
|
boolean |
Filter.childElementAllowed(Element parent,
Element child)
Deprecated.
satisfies FilerCallback interface.
|
boolean |
FilterCallback.childTextAllowed(Element parent,
Text text)
Deprecated.
whether the specified text for the specified parent element is allowed through the filter.
|
boolean |
Filter.childTextAllowed(Element parent,
Text text)
Deprecated.
satisfies FilerCallback interface.
|