1b105a869339e9b8.tex
1: \begin{abstract}
2: % v1
3: \iffalse
4: The EDA community has been actively investigating the potential of machine learning (ML) for very large-scale integrated computer-aided design (VLSI CAD).
5: Numerous studies have explored learning-based techniques for cross-stage prediction tasks in the design flow, which has the potential to lead to faster design convergence.
6: However, despite the fact that building ML models often requires a large amount of data, most studies are limited to generating small internal datasets for validation purposes due to the scarcity of large public datasets.
7: This essay introduces \textcolor{red}{Benchmark Name}, the most comprehensive open-source dataset for ML tasks in VLSI CAD.
8: \fi
9: % v2
10: The application of Machine Learning (ML) in Electronic Design Automation (EDA) for Very Large-Scale Integration (VLSI) design has garnered significant research attention.
11: Despite the requirement for extensive datasets to build effective ML models, most studies are limited to smaller, internally generated datasets due to the lack of comprehensive public resources.
12: In response, we introduce EDALearn, the first holistic, open-source benchmark suite specifically for ML tasks in EDA.
13: This benchmark suite presents an end-to-end flow from synthesis to physical implementation, enriching data collection across various stages.
14: It fosters reproducibility and promotes research into ML transferability across different technology nodes.
15: Accommodating a wide range of VLSI design instances and sizes, our benchmark aptly represents the complexity of contemporary VLSI designs.
16: Additionally, we provide an in-depth data analysis, enabling users to fully comprehend the attributes and distribution of our data, which is essential for creating efficient ML models.
17: Our contributions aim to encourage further advances in the ML-EDA domain.
18: 
19: \end{abstract}
20: