The Workshop

The integration of foundation models into Autonomous Driving (AD) systems has the potential to revolutionize the field. Built on architectures with significant capacity, foundation models are capable of utilizing vast data collections through self-learning approaches. This enables them to achieve remarkable performance across diverse sets of tasks in different domains. Modern models such as DALL-E, CLIP, and SAM stand to be cornerstones for a wide range of solutions in the AD domain and have initiated new research directions.

The application of AD systems provides the opportunity to collect large amounts of data as they are equipped with high quality, multi-modal sensor suits. While the manual annotation of these data proves to be costly and time-intensive, foundation models not only allow tapping into this data source but also rapidly adapting to new tasks. Among other challenges, integrating different sensor modalities, considering the spatial and temporal nature of the data, and determining how to use it for planning and prediction remain open research questions. Additionally, issues like limited availability of computational resources and the importance of safety considerations are inherent to AD platforms and need to be addressed.

Scope and Topics

The topics of interest of the workshop include, but are not limited to:

  • Trends in model architectures: Examine the latest advancements in large vision models, multi-modal foundation models, and their customization for the AD domain.
  • Adaptation to AD sensor modalities: Typical perception data is spatially and temporally distributed and needs to be integrated with other inputs, which may include map-based information, language, and more.
  • Usage of large data sources: Explore the vast data streams generated by AD and their incorporation into the training and adaption of foundation models.
  • Self-learning methods for AD: Investigate and propose different self-learning methods, such as contrastive learning and reconstruction-based learning, for their use in the field of AD.
  • Identification of AD tasks: Take a closer look at the best practices for adding known and new downstream tasks to the foundation model and how they are trained.
  • Interpretability and trust: Investigate techniques for understanding and explaining foundational model decisions and incorporating safety guarantees.