Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Only two operations are used by this connector:

http://<host>:<port>/webhdfs/v1/<path>?op=OPEN

  • Used to fetch the files file data to be used to extract its content.

http://<host>:<port>/webhdfs/v1/<path>?op=LISTSTATUS

  • Used to scan a directory and get relevant file information like:
    • Last-Modified dates for incremental crawls.
    • Group and Owner