c6b978667f5247e2.tex
1: \begin{abstract}
2: Concept learning deals with learning description logic concepts from a background knowledge and input examples. 
3: The goal is to learn a concept that covers all positive examples, while not covering any negative examples.
4: This non-trivial task is often formulated as a search problem within an infinite quasi-ordered concept space.
5: Although state-of-the-art models have been successfully applied to tackle this problem, their large-scale applications have been severely hindered due to their excessive exploration incurring impractical runtimes.
6: Here, we propose a remedy for this limitation.
7: We reformulate the learning problem as a multi-label classification problem and propose a neural embedding model (NERO) that learns permutation-invariant embeddings 
8: for sets of examples tailored towards
9: predicting $F_1$ scores of pre-selected description logic concepts.
10: By ranking such concepts in descending order of predicted scores, a possible goal concept can be detected within few retrieval operations, i.e., no excessive exploration.
11: Importantly, top-ranked concepts can be used to
12: start the search procedure of state-of-the-art symbolic models in multiple advantageous regions of a concept space, rather than starting it in the most general concept $\top$.
13: Our experiments on 5 benchmark datasets with 770 learning problems firmly suggest that NERO significantly (p-value $<1\%$) outperforms the state-of-the-art models in terms of $F_1$ score, the number of explored concepts, and the total runtime.
14: We provide an open-source implementation of our approach.\footnote{\raggedright\url{https://github.com/dice-group/Nero}}
15: \end{abstract}
16: