3D modeling / imaging

WindsorML: High-fidelity computational fluid dynamics dataset for automotive aerodynamics

Neil Ashton, Jordan B. Angel, Aditya S. Ghate, Gaetan K. W. Kenway, Man Long Wong, Cetin Kiris, Astrid Walle, Danielle Maddix Robinson, Gary Page

NeurIPS 2024

2024

This paper presents a new open-source high-fidelity dataset for Machine Learning (ML) containing 355 geometric variants of the Windsor body, to help the development and testing of ML surrogate models for external automotive aerodynamics. Each Computational Fluid Dynamics (CFD) simulation was run with a GPU-native high-fidelity Wall-Modeled Large-Eddy Simulations (WMLES) using a Cartesian immersed-boundary

Machine learning

SD2: Synthetic doppler spectrum denoiser using SSM

Koushik Manjunatha, Morris Hsu, Rohit Kumar

MLTEC 2024

2024

The increasing popularity of wireless sensing applications has led to a growing demand for large datasets of realistic wireless data. However, collecting such wireless data is often time-consuming and expensive. To address this challenge, we propose a synthetic data generation pipeline using human mesh generated from videos that can generate data at scale. The pipeline first generates a 3D mesh of the human

Computer vision

The Fuse platform: Integrating data from IoT and other sensors into an industrial spatial digital twin

Gregory Biegel, Nicholas Bower, Will Castelnau

ISPRS Technical Commission IV Symposium 2024

2024

Digital Twins as virtual representations of industrial assets are being used to assimilate varied sources of data for improved awareness and decision making in operations and process optimisation. This paper explores the integration of IoT sensors into a spatial digital twin called Fuse that Woodside Energy has been building for the assets it operates. We describe the Fuse platform and its knowledge graph

Cloud and systems

DiffSign: AI-assisted generation of customizable sign language videos with enhanced realism

Sudha Krishnamurthy, Vimal Bhat, Abhinav Jain

ECCV 2024 Workshop on Assistive Computer Vision and Robotics

2024

The proliferation of several streaming services in recent years has now made it possible for a diverse audience across the world to view the same media content, such as movies or TV shows. While translation and dubbing services are being added to make content accessible to the local audience, the support for making content accessible to people with different abilities, such as the Deaf and Hard of Hearing

Computer vision

DPA-Net: Structured 3D abstraction from sparse views via differentiable primitive assembly

Fenggen Yu, Yiming Qian, Xu Zhang, Francisca Gil Ureta, Brian Jackson, Eric Bennett, Richard Zhang

ECCV 2024

2024

We present a differentiable rendering framework to learn structured 3D abstractions in the form of primitive assemblies from sparse RGB images capturing a 3D object. By leveraging differentiable volume rendering, our method does not require 3D supervision. Architecturally, our network follows the general pipeline of an image-conditioned neural radiance field (NeRF) exemplified by pixelNeRF for color prediction

Computer vision

Annorama: Enabling immersive at-desk annotation experiences in virtual reality with 3D point cloud dioramas

Subramanian Chidambaram, Alex C. Williams, Min Bai, Satyugjit Virk, Patrick Haffner, Matthew Lease, Erran Li

ACM SUI 2024

2024

Point cloud annotation plays a pivotal role in computer vision and machine learning by facilitating the creation of volumetric annotations in 3D space. While prior research has explored point cloud annotation in VR environments, its practical implementation in space-constrained office settings, where data annotation is typically conducted, remains an open question. In this paper, we introduce Annorama,

Information and knowledge management

GenRC: Generative 3D room completion from sparse image collections

Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen, Chi Liu, Yu-Lun Liu, Albert Chen, Cheng-Hao Kuo, Min Sun

ECCV 2024

2024

Sparse RGBD scene completion is a challenging task especially when considering consistent textures and geometries throughout the entire scene. Different from existing solutions that rely on human-designed text prompts or predefined camera trajectories, we propose GenRC, an automated training-free pipeline to complete a room-scale 3D mesh with high-fidelity textures. To achieve this, we first project the

Computer vision

Predicting transient response using data-driven models for ball-impact simulations

Ross Pivovar, Fei Chen, Raghunath Katragadda, Vidyasagar Ananthan

Journal of Physics Communications

2024

This study investigates the application of machine learning (ML) models for predicting transient responses in ball-impact elastodynamics simulations. We focus on the canonical problem of ball impact on laminated structures, which captures essential physics while maintaining computational tractability. Novel contributions include: (1) development of a temporal multi-resolution strategy for stable long-time

Machine learning

ViewFusion: Towards multi-view consistency via interpolated denoising

Xianghui Yang, Yan Zuo, Sameera Ramasinghe, Loris Bazzani, Gil Avraham, Anton van den Hengel

CVPR 2024

2024

Novel-view synthesis through diffusion models has demonstrated remarkable potential for generating diverse and high-quality images. Yet, the independent process of image generation in these prevailing methods leads to challenges in maintaining multiple view consistency. To address this, we introduce ViewFusion, a novel, training-free algorithm that can be seamlessly integrated into existing pre-trained

Computer vision

Improving the convergence of dynamic nerfs via optimal transport

Sameera Ramasinghe, Violetta Shevchenko, Gil Avraham, Hisham Husain, Anton van den Hengel

ICLR 2024

2024

Synthesizing novel views for dynamic scenes from a collection of RGB inputs poses significant challenges due to the inherent under-constrained nature of the problem. To mitigate this ill-posedness, practitioners in the field of neural radiance fields (NeRF) often resort to the adoption of intricate geometric regularization techniques, including scene flow, depth estimation, or learned perceptual similarity

Computer vision

3D modeling / imaging

Work with us