Philipp-Lorenz Glaser
Univ.Ass. Dipl.-Ing. BSc
Philipp-Lorenz Glaser
- Email: philipp-lorenz.glaser@tuwien.ac.at
- Phone:
- Office: FB0103 (1040 Wien, Erzherzog-Johann-Platz 1)
- About:
- Orcid:
- Keywords:
- Roles: PreDoc Researcher
Publications
The extended EA ModelSet—a FAIR dataset for researching and reasoning enterprise architecture modeling practices
Philipp-Lorenz Glaser
Emanuel SallingerKeywords: ArchiMate, Artificial intelligence, Conceptual modeling, Dataset, Enterprise architecture, Enterprise modeling, FAIR, Machine learning
Astract: Conceptual modeling research is increasingly investigating the application of artificial intelligence (AI) and machine learning (ML) to automate tasks like model creation, completion, analysis, and processing. This trend also applies to enterprise architecture (EA) research. In contrast to its neighboring disciplines, such as business process management, EA lacks proper guidelines, patterns, and best practices to create high-quality EA models. A currently limiting factor for conducting AI-based research to bridge these gaps is the scarcity of openly available models of adequate quality and quantity. With this paper, our aim is to address this limitation by introducing the extended EA ModelSet, a curated and FAIR repository of enterprise architecture models represented in the ArchiMate modeling language that can be used by the research and practitioner community. We report on our efforts to build the EA ModelSet and elaborate on exemplary future empirical and ML-based research that can facilitate the dataset. We hope that this paper sparks a community effort toward the further development and maintenance of the EA ModelSet.
Glaser, P.-L., Sallinger, E., & Bork, D. (2025). The extended EA ModelSet—a FAIR dataset for researching and reasoning enterprise architecture modeling practices. Software and Systems Modeling, Article 111431. https://doi.org/10.1007/s10270-025-01278-1
Encoding semantic information in conceptual models for machine learning applications
Philipp-Lorenz GlaserKeywords: conceptual modeling, encoding, machine learning
Astract: The integration of Conceptual Modeling (CM) and Machine Learning (ML) has given rise to a growing research field known as Machine Learning for Conceptual Modeling (ML4CM), where ML techniques are applied to support modeling tasks such as classifica-tion, completion, or repair. A crucial factor in these applications is the transformation of conceptual models into ML-compatible representations, called encodings. A wide variety of encoding strategies exist that draw on different information sources within conceptual models, depending on the specific use case. However, existing ML4CM studies tend to treat encodings as fixed and focus predominantly on tuning ML algorithms or hyperparameters. Consequently, encoding strategies and their internal configuration options receive limited scrutiny during evaluation, making it difficult for researchers and practitioners to select and adapt optimal encodings for specific tasks.This thesis addresses this gap by developing and evaluating a set of configurable semantic encodings for conceptual models. Specifically, it investigates how semantic information (e.g. names, types, contextualrelationships) within models can be systematically extracted and transformed into ML-compatible representations. The work adopts the Design Science Research methodology and extends the CM2ML framework with an ArchiMate parser and four semantic encoders: Bag-of-Words (BoW), Term Frequency (TF), Embeddings,and Triples. Each encoder captures distinct semantic aspects and supports extensive configurability to enable experimentation and task-specific adaptation. Furthermore, all encodings can be interactively visualized within the framework, offering real-time insight into parameter effects and traceability to link encoded features back to their source model elements.To evaluate the proposed encodings, the thesis combines a qualitative comparison based on defined criteria with a quantitative assessment through two representative ML tasks.The first task, dummy classification, employs TF encodings to distinguish dummy views from valid ones and explores the impact of common NLP parameters and weighting schemes. The second task, node classification, aims to predict element types based on local context, using triple encodings enriched with word embeddings for element names and one-hot vectors for types. The results demonstrate the suitability of the encodings for specific ML4CM tasks and that certain encoding configurations can have a substantial influence on model performance.
Glaser, P.-L. (2025). Encoding semantic information in conceptual models for machine learning applications [Diploma Thesis, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2025.119285
Teaching
Advanced Model Engineering
Semester: 2026S; Nr: 194.195; Type: VU; Hours: 4.0; Language: English; View on TISSModel Engineering
Semester: 2025W; Nr: 188.923; Type: VU; Hours: 4.0; Language: English; View on TISSTeam
Business Informatics Group, TU Wien
Professors
Christian Huemer
Ao.Univ.Prof. Mag.rer.soc.oec.Dr.rer.soc.oec.
Dominik Bork
Associate Prof. Dipl.-Wirtsch.Inf.Univ.Dr.rer.pol.
Gerti Kappel
O.Univ.Prof.in Dipl.-Ing.inMag.a Dr.in techn.
Henderik Proper
Univ.Prof. PhDResearchers
Aleksandar Gavric
Univ.Ass. M.Eng. M.Sc. B.Eng.Charlotte Roos R. Verbruggen
Univ.Ass. PhD
Marco Huymajer
Senior Lecturer Dipl.-Ing. BSc
Marianne Schnellmann
Univ.Ass. MScMarion Murzek
Senior Lecturer Mag.a rer.soc.oec.Dr.in rer.soc.oec.
Marion Scholz
Senior Lecturer Dipl.-Ing.inMag.a rer.soc.oec.
Miki Zehetner
Univ.Ass. DI Bakk.rer.soc.oec. MSc




