...
Only two operations are used by this connector:
http://<host>:<port>/webhdfs/v1/<path>?op=OPEN
- Used to fetch the files file data to be used to extract its content.
http://<host>:<port>/webhdfs/v1/<path>?op=LISTSTATUS
- Used to scan a directory and get relevant file information like:
- Last-Modified dates for incremental crawls.
- Group and Owner