Attention-Enhanced Multimodal Learning for Conceptual Design Evaluations-Reference-Cited by-同舟云学术

Attention-Enhanced Multimodal Learning for Conceptual Design Evaluations

Published:2023-02-03 Issue:4 Volume:145 Page:
ISSN:1050-0472
Container-title:Journal of Mechanical Design
language:en
Short-container-title:

Author:

Song Binyang¹,Miller Scarlett²,Ahmed Faez¹

Affiliation:

1. Massachusetts Institute of Technology Department of Mechanical Engineering, , Cambridge, MA 02139

2. School of Engineering Design and Innovation, The Pennsylvania State University , State College, PA 16802

Abstract

Abstract Conceptual design evaluation is an indispensable component of innovation in the early stage of engineering design. Properly assessing the effectiveness of conceptual design requires a rigorous evaluation of the outputs. Traditional methods to evaluate conceptual designs are slow, expensive, and difficult to scale because they rely on human expert input. An alternative approach is to use computational methods to evaluate design concepts. However, most existing methods have limited utility because they are constrained to unimodal design representations (e.g., texts or sketches). To overcome these limitations, we propose an attention-enhanced multimodal learning (AEMML)-based machine learning (ML) model to predict five design metrics: drawing quality, uniqueness, elegance, usefulness, and creativity. The proposed model utilizes knowledge from large external datasets through transfer learning (TL), simultaneously processes text and sketch data from early-phase concepts, and effectively fuses the multimodal information through a mutual cross-attention mechanism. To study the efficacy of multimodal learning (MML) and attention-based information fusion, we compare (1) a baseline MML model and the unimodal models and (2) the attention-enhanced models with baseline models in terms of their explanatory power for the variability of the design metrics. The results show that MML improves the model explanatory power by 0.05–0.12 and the mutual cross-attention mechanism further increases the explanatory power of the approach by 0.05–0.09, leading to the highest explanatory power of 0.44 for drawing quality, 0.60 for uniqueness, 0.45 for elegance, 0.43 for usefulness, and 0.32 for creativity. Our findings highlight the benefit of using multimodal representations for design metric assessment.

Publisher

ASME International

Subject

Computer Graphics and Computer-Aided Design,Computer Science Applications,Mechanical Engineering,Mechanics of Materials

Link

https://asmedigitalcollection.asme.org/mechanicaldesign/article-pdf/145/4/041410/6978832/md_145_4_041410.pdf

Reference79 articles.

1. Antecedents and Consequences of Reflexivity in New Product Idea Screening*;Hammedi;J. Product Innov. Manage.,2011

2. How Should We Measure Creativity in Engineering Design? A Comparison Between Social Science and Engineering Approaches;Miller;ASME J. Mech. Des.,2021

3. When Are Designers Willing to Take Risks? How Concept Creativity and Prototype Fidelity Influence Perceived Risk;Starkey;ASME J. Mech. Des.,2019

4. Universal Sentence Encoder;Cer,2018

5. Ideas Generated in Conceptual Design and Their Effects on Creativity;Sarkar;Res. Eng. Des.,2014

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Replicability and reproducibility of data-intensive design research using workflows - example in facial expression synchrony as a measure of empathy;Journal of Engineering Design;2024-09

2. Opportunities for large language models and discourse in engineering design;Energy and AI;2024-09

3. Data-Driven Car Drag Prediction With Depth and Normal Renderings;Journal of Mechanical Design;2024-03-28

4. Integration of data science with product design towards data-driven design;CIRP Annals;2024

5. Large Language Model-Based Online Review Classification for Sub-Feature-Level Customer Opinion Analysis;2024