SKH-NLP at LLMs4OL 2024 Task B: Taxonomy Discovery in Ontologies Using BERT and LLaMA 3

Authors

DOI:

https://doi.org/10.52825/ocp.v4i.2483

Keywords:

Large Language Models, LLMs, Ontologies, Ontology Learning, Fine-tuning, Prompting, Prompt-based Learning, BERT, LLaMA 3

Abstract

Taxonomy discovery in ontologies refers to extracting the parent class from the child class. By modeling this task as a classification problem, we addressed it using two different approaches. The first approach involved fine-tuning the “BERT-Large” model with various prompts and using it in a classification system. In the second approach, we utilized the “LLaMA 3 70B” model, experimenting with different prompts and modifying them to achieve the best results. Additionally, we evaluated the correctness of the answers using substring and Levenshtein distance functions. The results indicate that, with appropriate fine-tuning, the BERT model can achieve performance levels comparable to those of more recent and significantly larger language models, such as LLaMA 3 70B. However, with appropriate prompts, LLaMA 3 70B performs slightly better than BERT, highlighting the importance of prompt quality. Ultimately, further experiments on different settings for fine-tuning BERT, few-shot learning, and using knowledge graphs for validating the model's answers for LLaMA are recommended to improve the results. Additionally, testing other models and examining the results of various encoder-based and decoder-based models can be employed.

Downloads

Download data is not yet available.

References

[1] H. Babaei Giglou, J. D’Souza, and S. Auer, “Llms4ol: Large language models for ontology learning,” in The Semantic Web – ISWC 2023, T. R. Payne, V. Presutti, G. Qi, et al., Eds., Cham: Springer Nature Switzerland, 2023, pp. 408–427, ISBN : 978-3-031-47240-4.

[2] H. Babaei Giglou, J. D’Souza, and S. Auer, “Llms4ol 2024 overview: The 1st large language models for ontology learning challenge,” Open Conference Proceedings, vol. 4, Oct. 2024.

[3] H. Babaei Giglou, J. D’Souza, S. Sadruddin, and S. Auer, “Llms4ol 2024 datasets: Toward ontology learning with large language models,” Open Conference Proceedings, vol. 4, Oct. 2024.

[4] “Geonames.” (n.d.), [Online]. Available: https://www.geonames .org/ (visited on 08/05/2024).

[5] V. Levenshtein, “Binary codes capable of correcting deletions, insertions, and reversals,” Proceedings of the Soviet physics doklady, 1966.

Downloads

Published

2024-10-02

How to Cite

Hashemi, S. M. H., Karimi Manesh, M., & Shamsfard, M. (2024). SKH-NLP at LLMs4OL 2024 Task B: Taxonomy Discovery in Ontologies Using BERT and LLaMA 3. Open Conference Proceedings, 4, 103–111. https://doi.org/10.52825/ocp.v4i.2483

Conference Proceedings Volume

Section

LLMs4OL 2024 Task Participant Papers