This section describes two methods for extracting tables from PDF files.
This sample code utilizes the Unstructured Open Source library and also provides an alternative method the utilizing the Unstructured Partition Endpoint.
To extract the tables from PDF files using the partition_pdf, set the skip_infer_table_types
parameter to False
and strategy
parameter to hi_res
.
Usage
By default, table extraction from all file types is enabled. To extract tables from PDFs and images using Auto Partition or Unstructured API parameters simply set strategy
parameter to hi_res
.
Usage: Auto Partition
Usage: API Parameters
This section describes two methods for extracting tables from PDF files.
This sample code utilizes the Unstructured Open Source library and also provides an alternative method the utilizing the Unstructured Partition Endpoint.
To extract the tables from PDF files using the partition_pdf, set the skip_infer_table_types
parameter to False
and strategy
parameter to hi_res
.
Usage
By default, table extraction from all file types is enabled. To extract tables from PDFs and images using Auto Partition or Unstructured API parameters simply set strategy
parameter to hi_res
.
Usage: Auto Partition
Usage: API Parameters