Large Language Model and its Applications in the field of Architecture: A Survey
In recent years, large language models represented by ChatGPT and GPT-4 have made rapid technological progress and iteration, becoming the most revolutionary technology in the field of Artificial Intelligence. Large language models have made key breakthroughs in data information capacity, model parameter quantity, underlying model structure, and model training methods compared to previous language models. Their performance in tasks such as natural language processing, machine vision, and even general tasks continues to improve, including the emergence ability demonstrated by large language models. An overview of the technological evolution, architecture, key technologies, and main characteristics of large language models are provided in the paper. The basic architecture and core principles of large-scale models are introduced, their applications in the field of architecture are shared, their limitations, and future development directions are discussed. The aim is to promote the application and development of Artificial Intelligence technology represented by large language models in the field of architecture and civil engineering.
large language modelsemergent abilitiesadaptation tuningalignmentlarge model of architecture