PyData NYC 2024

Unstructured Data Processing with a Raspberry Pi AI Kit and Python
11-08, 15:20–16:00 (US/Eastern), Winter Garden

In today's talk I will walk through building applications in Python on a Raspberry Pi 5 enhanced with an AI Kit. We will grab images from a connected webcam, run models like Object Detection and Pose Estimation and stream data to a Milvus vector database for semantic search and RAG.


Working with unstructured data can be difficult due to its complexity, size, variability and volume. Fortunately AI and machine learning have made it possible to overcome these difficulties. The Raspberry Pi 5 + AI Kit, coupled with Python's robust libraries, offers a versatile and affordable platform for this purpose.

In this session, we'll talk about the following topics:

  • Introduction to Unstructured Data
  • Overview of the Raspberry Pi 5 + AI Kit
  • Processing Images and utilizied pre-trained models from Hailo
  • Integrating AI Models with Ollama
  • Fully documented example code and demos
  • Challenges, Limitations and Alternatives
  • Utilizing, Querying, Visualizing data with Milvus, Slack and other tools

Prior Knowledge Expected

No previous knowledge expected

https://github.com/tspannhw/SpeakerProfile

Tim Spann is a Principal Developer Advocate for Zilliz and Milvus. He works with Milvus, Towhee, Attu, GPTCache, Generative AI, HuggingFace, Python, Java, Apache NiFi, Apache Kafka, Apache Pulsar, Apache Flink, Flink SQL, Apache Pinot, Trino, Apache Iceberg, DeltaLake, Apache Spark, Big Data, IoT, Cloud, AI/DL, machine learning, and deep learning. Tim has over ten years of experience with the IoT, big data, distributed computing, messaging, streaming technologies, and Java programming. Previously, he was a Principal Developer Advocate at Cloudera, Developer Advocate at StreamNative, Principal DataFlow Field Engineer at Cloudera, a Senior Solutions Engineer at Hortonworks, a Senior Solutions Architect at AirisData, a Senior Field Engineer at Pivotal and a Team Leader at HPE. He blogs for DZone, where he is the Big Data Zone leader, and runs a popular meetup in Princeton & NYC on Big Data, Cloud, IoT, deep learning, streaming, NiFi, the blockchain, and Spark. Tim is a frequent speaker at conferences such as ApacheCon, DeveloperWeek, Pulsar Summit and many more. He holds a BS and MS in computer science.