New course with Unstructured: Preprocessing Unstructured Data for LLM Applications

Published: 10 April 2024
on channel: DeepLearningAI
4,870
120

Enroll now: https://bit.ly/3TOq2Hz

Introducing Preprocessing Unstructured Data for LLM Applications, a short course made in collaboration with Unstructured, aimed at helping you improve your RAG system to retrieve diverse data formats.

In this course, you'll learn techniques for representing all sorts of unstructured data, like text, images, and tables, from many different sources and implement them to extend your LLM RAG pipeline to include Excel, Word, PowerPoint, PDF, and EPUB files.

Through hands-on lessons, explore:

How to preprocess data for your LLM application development, focusing on how to work with different document types.
How to extract and normalize various documents into a common JSON format and enrich it with metadata to improve search results.
Techniques for document image analysis, including layout detection and vision transformers, to extract and understand the content of PDFs, images, and tables.
How to build a RAG bot capable of ingesting different documents such as PDFs, PowerPoints, and Markdown files.

Start processing and using diverse data types and formats to build high-performing LLM RAG systems.

Learn more: https://bit.ly/3TOq2Hz


Watch video New course with Unstructured: Preprocessing Unstructured Data for LLM Applications online without registration, duration hours minute second in high quality. This video was added by user DeepLearningAI 10 April 2024, don't forget to share it with your friends and acquaintances, it has been viewed on our site 4,870 once and liked it 120 people.