In this repository, we will explore how to ingest and embed PDF files at scale using Spark for Retrieval Augmented Generation. We will walk through the steps required ...