Learn how to use the Unstructured.io integration with Julep
Welcome to the Unstructured.io integration guide for Julep! This integration allows you to extract structured information from a wide variety of document formats, enabling you to build workflows that leverage advanced document processing capabilities. Whether you’re developing a document analysis system, creating a RAG pipeline, or need to convert unstructured documents into structured data, this guide will walk you through the setup and usage.
To use the Unstructured.io integration, you need an API key. You can obtain this key by signing up at Unstructured.io.
To get started with the Unstructured.io integration, follow these steps to configure and create a task:
Configure Your API Key
Add your API key to the tools section of your task. This will allow Julep to authenticate requests to Unstructured.io on your behalf.
Create Task Definition
Use the following YAML configuration to define your document parsing task:
Run Task
Deploy your task by creating a new execution.
Basic Configuration
unstructured_processor
is defined as an integration tool.Tool Configuration
integration
in this context.unstructured
for Unstructured.io.parse
for Unstructured.io.Execution Configuration
unstructured_processor
).UNSTRUCTURED_API_KEY
with your actual API key or use environment variables. For base64 encoded files, ensure your file is properly encoded before passing it to the integration.With the Unstructured.io integration, you can efficiently convert unstructured documents into structured data for analysis, search, and AI applications. This integration provides a powerful solution for document processing, enhancing your workflow’s capabilities and enabling advanced RAG (Retrieval-Augmented Generation) pipelines.
For more information, please refer to the Unstructured.io documentation.