Depending on your need, Unstructured
provides OCR-based and Transformer-based models to detect elements in the documents. The models are useful to detect the complex layout in the documents and predict the element types.
Basic usage:
To use any model with the partition, set the strategy
to hi_res
as shown above.
To maintain the consistency between the unstructured
and unstructured-api
libraries, we are deprecating the model_name
parameter. Please use hi_res_model_name
parameter when specifying a model.
The hi_res_model_name
parameter supports the yolox
and detectron2_onnx
arguments.
Unstructured
will download the model specified in UNSTRUCTURED_HI_RES_MODEL_NAME
environment variable. If not defined, it will download the default model.
There are three ways you can use the non-default model as follows:
partition
function.Depending on your need, Unstructured
provides OCR-based and Transformer-based models to detect elements in the documents. The models are useful to detect the complex layout in the documents and predict the element types.
Basic usage:
To use any model with the partition, set the strategy
to hi_res
as shown above.
To maintain the consistency between the unstructured
and unstructured-api
libraries, we are deprecating the model_name
parameter. Please use hi_res_model_name
parameter when specifying a model.
The hi_res_model_name
parameter supports the yolox
and detectron2_onnx
arguments.
Unstructured
will download the model specified in UNSTRUCTURED_HI_RES_MODEL_NAME
environment variable. If not defined, it will download the default model.
There are three ways you can use the non-default model as follows:
partition
function.