office-hours

Hands on With Docling (2025 Mar 13)

Event Details

Event sign up
🗓️: March 13, 2025 Thursday
⏰: 9 am PST / 11 am CST / 12 pm EST / 5pm GMT
Duration: 1 hour

Event recording will be available soon

Check resources - code, presentation slides ..etc

Q & A section


Agenda

Workshop: Hands-on with Docling

Overview

When building machine learning and data applications, a significant portion of your time will be dedicated to data wrangling - from content extraction and cleaning up data. This session introduces Dockling - a robust, open source tool, designed to handle many types of document formats including PDF, DOCX, HTML and PPTX. Attendees will learn first hand how to use Docling to extract and cleanup data from various documents

Description

Docling is a versatile document processor that handles various file types, including PDF, HTML, and DOCX. It can handle complex document structures like tables, multi-column format etc. It can even extract text from scanned documents. Docling is open source and easy to use.

More about docking: https://github.com/DS4SD/docling

Join us for this hands-on session to explore how to use Docling for your data needs.

In this workshop we will do the following:

What do you need to participate in this workshop?

Session Type:
Hands on workshop

Audience:
LLM app developers, data scientists, data engineers

Technical Level:
Intermediate

Prerequisites:
None

Duration
45 mins

Resources

will be available soon.

Speaker: Sujee Maniyam

AI Engineer, Developer Advocate @ Node51 (Consulting for IBM / The AI Alliance)

Sujee Maniyam is an expert in Generative AI, Machine Learning, Deep Learning, Big Data, Distributed Systems, and Cloud technologies. He is passionate about developer education, fostering community engagement. Sujee has led numerous training sessions, hackathons, and workshops. He is also an author, open source contributor and frequent speaker at conferences and meetups.

sujee@node51.com   •   Linkedin   •   portfolio


Q & A

Please review the session recording