abstract:c6b978667f5247e2.tex

1: \begin{abstract}

2: Concept learning deals with learning description logic concepts from a background knowledge and input examples.

3: The goal is to learn a concept that covers all positive examples, while not covering any negative examples.

4: This non-trivial task is often formulated as a search problem within an infinite quasi-ordered concept space.

5: Although state-of-the-art models have been successfully applied to tackle this problem, their large-scale applications have been severely hindered due to their excessive exploration incurring impractical runtimes.

6: Here, we propose a remedy for this limitation.

7: We reformulate the learning problem as a multi-label classification problem and propose a neural embedding model (NERO) that learns permutation-invariant embeddings

8: for sets of examples tailored towards

9: predicting $F_1$ scores of pre-selected description logic concepts.

10: By ranking such concepts in descending order of predicted scores, a possible goal concept can be detected within few retrieval operations, i.e., no excessive exploration.

11: Importantly, top-ranked concepts can be used to

12: start the search procedure of state-of-the-art symbolic models in multiple advantageous regions of a concept space, rather than starting it in the most general concept $\top$.

13: Our experiments on 5 benchmark datasets with 770 learning problems firmly suggest that NERO significantly (p-value $<1\%$) outperforms the state-of-the-art models in terms of $F_1$ score, the number of explored concepts, and the total runtime.

14: We provide an open-source implementation of our approach.\footnote{\raggedright\url{https://github.com/dice-group/Nero}}

15: \end{abstract}

16: