Team member(s): Bao Trinh and Eva Papaspyrou
Modified by Bao Trinh on September 30, 2024

Introduction
Methods
- Graph database
- Designing the building graph
- Extracting the building graph
  1. Architectural solid component classification
  2. Architectural space massing classification
- Fine-tuning LMMs for graph knowledge QA
- Deploying an application
Conclusion
Future Work

Abstract

The architectural design process often suffers from a disconnect between early-stage conceptual 3D models and the rich building knowledge crucial for informed decision-making. Traditional representations like brep, voxels, meshes, and point clouds, while powerful, can be computationally intensive or lack semantic flexibility. This research explores a graph-based approach to unify 3D building models and their associated knowledge, enabling a lighter, more flexible, and semantically rich representation. We propose to embed building information within the graph structure, facilitating direct interaction and analysis.
Furthermore, this research investigates the potential of Large Language Models (LLMs) to interact with this 3D knowledge graph, providing architects with a natural language interface for querying and manipulating building information. This could streamline the design process, enabling rapid exploration of design alternatives and fostering a deeper understanding of the building’s essence from the outset.
While challenges remain in effective graph construction, knowledge mapping, and algorithm development, this integrated approach promises to bridge the gap between conceptual design and building information, ultimately empowering architects to create buildings that are not only aesthetically pleasing but also functionally sound and contextually responsive.

Hypothesis

Different types of neural networks can help to extract graphs from unlabeled conceptual 3D models by performing classification tasks.
Fine-tuning the base large language model responsible for generating cypher queries will lead to a significant improvement in the accuracy of responses generated by the question-answering system, especially in the custom graph schema.

Objective

Develop a framework including 3D object detection methods to construct a knowledge graph of an apartment building in the conceptual stage.
Exploring the potential of large language models and graph database in interacting with the building’s knowledge graph.
Deploying all the models in one web application to enhance design decision-making.

Overall process

We start with a conceptual solid component, which is essentially a 3D representation of walls, slabs, doors, etc. We then classify the different components within the building. This helps us understand the boundary of spaces and the connection between them. Next, we extract the space massing from the spaces boundaries, which gives us a sense of the building’s volume and layout. We further classify the space massing into different categories, such as tower, podium, or circulation corridor….Finally, we extract the building graph. This is a structured representation of the building’s spaces and their relationships. Now, with our building graph in place, we can leverage the power of fine-tuned Large Language Models (LLMs) for graph querying. This enables us to ask complex questions about the building and get insightful answers. We store all this valuable data in a graph database for easy access and management. Ultimately, this entire process feeds into building graph Q&A applications, which allows us to interact with the building knowledge graph in a natural and intuitive way.

Graph database (Neo4j and Cypher)

Architecture is about organizing spaces, so that graphs offer a flexible and intuitive way to represent the complex relationships inherent in architectural data. We choose Neo4j, a leading graph database, excels in handling this type of information. Its query language, Cypher, allows us to easily retrieve and analyze data within the graph

Designing the building graph

This schema show how we design our building graph. As mentioned before, relationships are important, our graph is structured to be in 3D, incorporating spatial data, to facilitate complex circulation-based queries within the building. Horizontal edges are Corridor, vertical ones are Stair or Elevator with length attribute for time-based queries

Architectural solid component classification

In order to segment the building into distinct spaces, we first need to classify the solid components within the conceptual model. This classification step is essential for accurately defining the boundaries of each space

Dataset creation

The solid component dataset was created with the help of PlanFinder AI generating the 2D apartment floor plans, and some other algorithms in Grasshopper to convert them into 3D

Classes

The classes are divided into 4 main categories: Wall, Slab, Door, Window. But for wall we divided into BoundaryWall and InnerWall to segment spaces bounding. The Door is divided into OtherDoor and DoorToCorridor, which are going be used as a node connecting to the corridor network

Features distribution

Next, we extract the geometrical characteristics of the solid as features to train the classification model. The Length, Height and Depth are from the bounding box. While the NumberOfFaces, Top/SidesArea and ClashCount are extracted directly from the solid objects to ensure that the correlation with those dimensional features are dynamic. The ClashCount is the number of neighbors that clash with the component

Features correlation

This heatmap visualizes that the correlation is as expected. There are no repeated pattern among all rows or columns, which can be great for the model to differentiate labels

Features in pairs

This pair plot shows how pairs of features can be used to differentiate labels, some categories can be done easily, like Slab for example (the brown points)

Features of different door types

But the concern is how can the model recognize different types of door when most feature values are overlapping, as shown in this plot.

Features of different wall types

The same as for BoundaryWall and InnerWall

Artificial neural network for classification

We train those feature with an artificial neural network. After 600 epochs, the accuracy is pretty high. But as suspected, the ANN model fail to differentiate different types within the same category, especially for doors; all the DoorToCorridor labels are mispredicted

Graph approach for neighbor awareness

A graph approach is used for embedding the neighbor awareness into each component. We try to represent the position of the neighbor components by one-hot encoded the dot product of the vector connecting center of the node to center of it’s neighbor. Only the Length is float number

Solid component star graph

This is how the star graph of each component looks like. We expect that when the node knows that there is a large flat object nearby. It may recognize itself as a DoorToCorridor.

Graph neural network for classification

After only 200 epochs of training, one-third compared to the ANN training, the graph neural network ‘s performance on the test set is 100% correct

Deploying in Grasshopper

We then deploy the models in Grasshopper. As you can see, without the context awareness, the ANN model fails to catch the DoorToCorridor.

Architectural space massing classification

From the BoundaryWall and Slab, we extract the unlabeled space massing of different spaces. We need to train a model to categorize those distinct spaces within the building for graph creation.