Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.



Easy Heading Free
navigationTitleOn this Page
wrapNavigationTexttrue
navigationExpandOptionexpand-all-by-default

Introduction


The Slide Extractor is a component that detects a pptx PPTX PowerPoint file and parseparses/extract extracts the PPTX slides using Apache Tika:

Features

  • Extracting text content from PPTX slides.
  • Extracting metadata such as slide title, author, created date, and modified date.
  • Configurable max characters file size for processing large PPTX files.
  • Configurable timeout for parsing process






...