Skip to content

A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks

License

Notifications You must be signed in to change notification settings

Mohamedelrefaie/DrivAerNet

Repository files navigation

DrivAerNet++

Update (19.11.2024): DrivAerNet++ has been accepted to NeurIPS 2024! The full dataset is now released on Harvard Dataverse. Please note the (CC BY-NC 4.0) license terms, as outlined in the License section.

Update (11.09.2024): Due to the overwhelming interest and numerous inquiries from industry partners, we are excited to announce that we are now offering commercial licensing options for the DrivAerNet and DrivAerNet++ datasets. Please refer to the DrivAerNet/DrivAerNet++ Commercial Inquiry section.

Update (15.08.2024): DrivAerNet has been utilized in the IJCAI 2024 competition. All competitors' code and results are available in the open-source repository: Rapid Aerodynamic Drag Prediction for Arbitrary Vehicles in 3D Space.

Update (11.07.2024): DrivAerNet is now integrated into NVIDIA Modulus FIGConvUNet and AeroGraphNet

Our new preprint: DrivAerNet++ paper here

Video summary of DrivAerNet++ paper here

DrivAerNet paper: here

We present DrivAerNet++, the largest and most comprehensive multimodal dataset for aerodynamic car design. DrivAerNet++ comprises 8,000 diverse car designs modeled with high-fidelity computational fluid dynamics (CFD) simulations. The dataset includes diverse car configurations such as fastback, notchback, and estateback, with different underbody and wheel designs.

Design Parameters

Design parameters for the generation of the DrivAerNet++ dataset. Several geometric parameters with significant impact on aerodynamics were selected and varied within a specific range. These parameter ranges were chosen to avoid values that are either difficult to manufacture or not aesthetically pleasing.

Shape Variation

DrivAerNet++ covers all conventional car designs. The dataset encompasses various underbody and wheel designs to represent both internal combustion engine (ICE) and electric vehicles (EV). This extensive coverage allows for comprehensive studies on the impact of geometric variations on aerodynamic performance. By including a diverse set of car shapes, DrivAerNet++ facilitates the exploration of aerodynamic effects across different vehicle types, supporting both academic research and industrial applications.

Each 3D car geometry is parametrized with 26 parameters that completely describe the design. To create diverse car designs, we used two main morphing methods: morphing boxes and direct morphing. For a detailed description of the design parameters, their ranges, lower and upper bounds, please refer to the paper.

DrivAerNet_params-ezgif com-crop

Importance of Dataset Diversity

Dataset diversity and shape variation are crucial for developing robust deep learning models in aerodynamic car design. By providing a wide range of car shapes and configurations with high-fidelity CFD, DrivAerNet++ enables models to generalize better, supports exploration of unconventional designs, and enhances understanding of how geometric features impact aerodynamic performance.

DrivAerNet_Demo_cropped

Dataset Contents & Modalities

  • Parametric Models: Parametric models with tabular data, allowing extensive exploration of automotive design variations.
  • Point Cloud Data: Point cloud data for each car design.
  • 3D Car Meshes: Detailed 3D meshes of each car design, suitable for various machine learning applications.
  • CFD Simulation Data: High-fidelity CFD simulation data for each car design, including 3D volumetric fields, surface fields, and streamlines.
  • Aerodynamic Coefficients: Key aerodynamic metrics such as drag coefficient (Cd), lift coefficient (Cl), and more.

DatasetContents

Dataset Annotations

In addition to the CFD simulation data, our dataset includes detailed annotations for various car components (29 labels), such as wheels, side mirrors, and doors. These annotations are instrumental for a range of machine learning tasks, including classification, semantic segmentation, and object detection. The comprehensive labeling can also facilitate automated CFD meshing processes by providing precise information about different car components. By incorporating these labels, our dataset enhances the utility for developing and testing advanced algorithms in automotive design and analysis.

DrivAerNet_ClassLabels_new

Computational Cost

Running the high-fidelity CFD simulations for DrivAerNet++ required substantial computational resources. The simulations were conducted on the MIT Supercloud, leveraging parallelization across 60 nodes, totaling 2880 CPU cores, with each CFD case using 256 cores and 1000 GBs of memory. The full dataset requires 39 TB of storage space. The simulations took approximately 3 × 10⁶ CPU-hours to complete.

Applications

DrivAerNet++ supports a wide array of machine learning applications, including but not limited to:

  • 🚀 Data-driven design optimization: Optimize car designs based on aerodynamic performance.
  • 🧠 Generative AI: Train generative models to create new car designs based on performance or aesthetics.
  • 🎯 Surrogate models: Predict aerodynamic performance without full CFD simulations.
  • 🔥 CFD simulation acceleration: Speed up simulations using machine learning and multi-GPU techniques.
  • 📉 Reduced Order Modeling: Create data-driven reduced-order models for efficient & fast aerodynamic simulations.
  • 💾 Large-Scale Data Handling: Efficiently store and manage large datasets from high-fidelity simulations.
  • 🗜️ Data Compression: Implement high-performance lossless compression techniques.
  • 🌐 Part and shape classification: Classify car categories or components to enhance design analysis.
  • 🔧 Automated CFD meshing: Automate the meshing process based on car components to streamline simulations.

Results Replication

DrivAerNet++ serves as a valuable benchmark dataset due to its size and diversity. It provides extensive coverage of various car designs and configurations, making it ideal for testing and validating machine learning models in aerodynamic design. We provide the train, test, and validation splits in the following folder: train_val_test_splits.

To replicate the results based on the parametric tabular data, please refer to the ParametricModels directory and the code in AutoML_parametric.py.

To replicate the results based on geometric deep learning methods, please refer to the code saved in the DeepSurrogates directory.

image

Drag values for the 8k car designs can be found Here . Update 02.01.2025: The drag values are now normalized by each car's projected frontal area and are consistent with the STL and VTK files available on Harvard Dataverse.

Datasets Comparison

DrivAerNet++ stands out as the largest and most comprehensive dataset in the field of car design.

image

Contributing

We welcome contributions to improve the dataset or project. Please submit pull requests for review.

Maintenance and Support

Maintained by the DeCoDE Lab at MIT. Report issues via GitHub issues.

Additional Resources

Previous Version

To replicate the code and experiments from the first version of DrivAerNet, please refer to the folder: DrivAerNet_v1.

License

Strict Licensing Notice: DrivAerNet/DrivAerNet++ is released under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0) and is exclusively for non-commercial research and educational purposes. Any commercial use—including, but not limited to, training machine learning models, developing generative AI tools, creating software products, running new simulations using the provided geometries or any derived geometries, or other commercial R&D applications—is strictly prohibited. Unauthorized commercial use of DrivAerNet/DrivAerNet++, or any derived data, will result in enforcement by the MIT Technology Licensing Office (MIT TLO) and may carry legal consequences. The code is distributed under the MIT License.

DrivAerNet/DrivAerNet++ Commercial Inquiry

If you are interested in the commercial use of the DrivAerNet or DrivAerNet++ datasets, please contact Mohamed Elrefaie ([email protected]) and Faez Ahmed ([email protected]) with the subject line: "DrivAerNet Commercial Inquiry".

Citations

To cite this work, please use the following reference:

@article{elrefaie2024drivaernet++,
  title={DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks},
  author={Elrefaie, Mohamed and Morar, Florin and Dai, Angela and Ahmed, Faez},
  journal={arXiv preprint arXiv:2406.09624},
  year={2024}
}

To cite the first version of DrivAerNet, please use the following reference:

@proceedings{elrefaie2024drivaernet,
    author = {Elrefaie, Mohamed and Ahmed, Faez and Dai, Angela},
    title = {DrivAerNet: A Parametric Car Dataset for Data-Driven Aerodynamic Design and Graph-Based Drag Prediction},
    volume = {Volume 3A: 50th Design Automation Conference (DAC)},
    series = {International Design Engineering Technical Conferences and Computers and Information in Engineering Conference},
    pages = {V03AT03A019},
    year = {2024},
    month = {08},
    doi = {10.1115/DETC2024-143593},
    url = {https://doi.org/10.1115/DETC2024-143593},
}