abstract:4b99897649f1549c.tex

1: \begin{abstract}

2: Despite the popularity and widespread use of semi-structured data formats such as JSON, end-to-end supervised learning applied directly to such data remains underexplored.

3: We present \origami (\textbf{O}bject \textbf{R}epresentat\textbf{I}on via \textbf{G}enerative \textbf{A}utoregressive \textbf{M}odell\textbf{I}ng), a transformer-based architecture that directly processes nested key/value pairs while preserving their hierarchical semantics. Our key technical contributions include: (1) a structure-preserving tokenizer, (2) a novel key/value position encoding scheme, and (3) a grammar-constrained training and inference framework that ensures valid outputs and accelerates training convergence. These enhancements enable efficient end-to-end modeling of semi-structured data. By reformulating classification as next-token prediction, \origami naturally handles both single-label and multi-label tasks without architectural modifications. Empirical evaluation across diverse domains demonstrates \origami's effectiveness: On standard tabular benchmarks converted to JSON, \origami remains competitive with classical and state-of-the-art approaches. On native JSON datasets, we outperform baselines on multi-label classification and specialized models such as convolutional and graph neural networks on a code classification task. Through extensive ablation studies, we validate the impact of each architectural component and establish \origami as a robust framework for end-to-end learning on semi-structured data.

4: \end{abstract}

5: