4.1. Text Sentiment Analysis Technology
For the premise of sustainable reuse of URs, the importance of URs must be considered with the residents as the center, to meet the requirements of residents. Their strong support is needed in the process of sustainable reuse. The DL algorithm can effectively obtain the unstructured text information in social media comments, and achieve the text classification requirements such as the sentiments’ polarity judgment [
31]. In the first stage, the text sentiment analysis technology based on the DL algorithm provides a solution for the quantitative analysis of unstructured and semi-structured network comment data, breaking through the limitations of information sources, data volume, authenticity, and other aspects of the traditional quantitative analysis methods such as questionnaires. It provides decision support for the design strategy of the project of sustainable reuse of urban ruins in the later period. In this stage, social media comments are used as the data source, and a deep learning algorithm is used as the support to measure the emotional orientation of residents in the sustainable reuse of urban ruins. The research steps follow.
Data collection and cleaning: with the five Chinese network platforms of Ctrip, Mafengwo, Meituan Peripheral Tour, Qunar.com, and Sina Weibo as the data sources, we use python’s BeautifulSoup, URL-lib, re regular expressions, and sqlite3 to achieve crawling, parsing, and storage of the text comment data, we find one of the time nodes, obtain the text comment dataset, and then conduct data cleaning to ensure the quality of later data research. Firstly, we eliminate the advertisement and other types of data that are completely irrelevant to the required data in the comment data set. Secondly, we delete the data published on an official account that cannot represent personal sentiments or tendencies. Following this, we de-duplicate the duplicate data in the dataset to eliminate some empty text data. Finally, we get the data after the cleaning.
Data annotation and classification: by collecting and cleaning the residents’ comments in the Social big data, and integrating the content and elements of URs that the residents pay the most attention to, the review content on the reuse direction of URs is divided into six categories: urban parks, museums, gymnasiums, cultural and creative parks, residential areas, and shopping malls. Text classification can be divided into two categories: multicategory and multi-label categories. Based on the EASYDL [
32] platform under the PaddlePaddle [
33] DL framework, this study uses a multi-label text classification model to annotate and classify the text comment data of evaluation categories to evaluate the sentiment tendencies of different content layers. The specific implementation methods are as follows: first, a total of 10,913 pieces of data was obtained through the crawler, and a total of 2906 pieces of valid data was obtained after data pre-processing. Ninety percent of the total dataset was randomly selected as the training set, i.e., 2616 input models for training. Five percent was used as the development set, i.e., 145 pieces of data were used for cross-validation and parameter tuning during the model training. The number of test sets was 5%, and 145 pieces of data were tested on the test set after the model is trained. The multi-label text classification dataset is then created on the EASYDL platform, and the training patterns are manually labeled. Second, the model training is performed for multi-label text classification data, and the model evaluation report and verification results are optimized step by step; finally, after the model training is completed, the model is invoked via the Restful API interface, and the patterns to be classified are classified [
34]. The text classification model often takes the accuracy rate
, the recall rate
, and the
value as the evaluation indicators. The accuracy rate is based on the prediction results, which means the actual positive proportion of the samples with a positive prediction: the higher the accuracy rate, the more accurate the model classification. The recall rate is specific to the sample, that is, the number of positive samples in the sample that are correctly predicted. The higher the re-call rate, the wider the coverage of the model, and the less text is missing. The
value is a comprehensive indicator that reflects the model’s classification performance. It is the harmonic mean of the accuracy rate and recall rate. The calculation formula of the evaluation index is as follows:
The accuracy rate, recall rate, and
value of the macro average are used to evaluate the multi-label text classification model. The calculation formula is as follows:
The final Macro-average accuracy rate P is 96.5% by the text classification model. The recall rate was 84.0%, and was 89.5%. This meets the requirements of an affective propensity analysis for the sample data that have been classified in the follow-up of this study.
The sentiment polarity is judged by the data obtained. The DL algorithm is widely used in the field of natural language processing, and has achieved better results than traditional models. Common deep learning algorithms mainly include Convolutional Neural Networks (CNN) [
35], the Long-Term Memory Model (LSTM) [
36], the Bidirectional LSTM Structure Model (BI-LSTM) [
37], and the Baidu Text Semantic Representation Model (ERNIE2.0) [
38]. The review data of the URs are mainly unstructured short texts. Most of the review texts contain the feature words that determine the overall emotional orientation of the text. Using ERNIE2.0, accurate emotional polarity can be determined. By calling the API interface, we can judge the emotional polarity category of the URs’ review data. At the same time, the bidirectional encoder representations from transformers’ BERT text analysis model [
39] has been widely used in recent years. The core idea of the BERT model is consistent with the Transformer model. By combining the relationship between each word in the text and other words, the restriction of distance is removed, and the dependency between the current word and other words in the sentence is explicitly expressed: the process is to fully combine the context information of the sentence, which better identifies the semantic information of the sentence, and also achieves the purpose of parallel processing. Its network structure is shown in
Figure 6a. The coding process of the model input is the sum of three vectors. The input form is shown in
Figure 6b, for the word vector representation, the location information coding, and the paragraph information-marking of each word in the input text, respectively. At the same time, two special symbols [CLS] and [SEP] are added; [CLS] is generally added at the beginning of the text. This feature can be extracted for classification models, and [SEP] represents the sentence-breaking symbol, which is used to break two sentences in the input text. Emotional polarity is divided into three categories: negative (0~0.333), neutral (0.333~0.667), and positive (0.667~1.000), with the corresponding confidence added. By using the deep learning algorithm to calculate the emotional value of the comment data, a lot of manual interpretation time can be saved, and it is universal.
4.2. GAN Technology
Through the analysis of the first stage, we can fully understand the tendency of the surrounding residents to the type of sustainable reuse of URs, which is helpful to meet the needs of the residents to the greatest extent after the reuse of URs, so that the URs can be truly activated. Using the GAN approach, the current situation analysis of UR reuse in the second stage is proposed for the renovation strategy based on the results obtained in the first stage, including site information extraction, assessment of the physical properties of the built environment, and design of the renovation plan. Since GAN was first proposed in 2014, the design field has gradually selected GAN to develop new design tools. Its various improved models show the unique advantages of confrontation training methods, which have been widely used in scenes such as picture style migration and picture repair. The emergence of GAN can add competence to some simple and repetitive work such as image recognition and classification, and target detection, etc. It can efficiently process multi-source site information data, sort out the information needed for site analysis, and gradually liberate productivity. Before starting the specific design work, GAN can be used to identify and classify the texture of forest land, construction land, farmland, water areas, mining land, roads, buildings, and other different land uses around the URs according to different classification targets and objects. Furthermore, it can be combined with remote sensing data to extract particular environmental information about the site, such as water turbidity, water body attributes, surface biomass, surface temperature and other climate attributes, and conduct deeper research and improve the ecological optimization of microsites. The assessment of a building’s environmental performance includes the damage degree of the building itself and the surrounding water, energy consumption, and waste generation. The AI is embedded into the possibility of environmental material performance assessment and the optimization at the initial stage of design, and the GAN is proposed in order to combine with environmental performance prediction and form an environmentally intelligent assessment method. Before the specific design scheme is carried out, the ML will be used at this stage to predict the load of the building equipment system and the building energy consumption. The real sample set of building energy consumption will be ex-pressed as , where x represents the input of the neural network, that is, the building energy consumption at the previous moments, and y represents the output value of the neural network, that is, the predicted energy consumption at the next moment.
As a generative model, GAN is seen as a minimal-maximization game whose objective is to improve the generating power of the generator model
and the discriminative power of the discriminator model
to reach a Nash equilibrium. The specific formula of GAN is shown in Formula (4):
Maintaining consistency with the whole sample set of the building energy consumption, the building energy consumption sample generated by GAN can be expressed as:
where
represents the previous moment’s produced building energy consumption, whereas
represents the following moment’s expected building energy consumption. Together with the created sample set of the building energy consumption, the actual sample set of the building energy consumption is utilized for training the neural network, accelerating the convergence of the neural network, and optimizing the GAN model using the regularization approach. When the GAN parameters continue to change,
and
yz are continually updated based on the actual energy consumption sample set, and
V approaches a global minimum. The design choice made at the earliest stage of the building design has a significant influence on the environmental performance of the design. In contrast, only a minor cost of design modification is incurred. At this stage, it is also necessary to fully consider the availability of building debris materials, and minimize the generation of construction waste. In the specific scheme design stage, several representative design cases are selected for analysis and reference. With GAN, we can quickly understand the area proportion of various land types of the case, to assist the design and quantitatively analyze the space allocation of the actual project according to the scale of the case, to improve the design efficiency. The model may be used to increase efficiency and reduce drawing time during the design process, particularly when there are a high number of solutions and different solutions. During the design process, we may also evaluate a number of design circumstances, include a number of components, and provide a number of solutions. The design is then coupled with geographical information to summarize public perception and assessment patterns of design solutions. In the scheme design stage, especially in the working scenario of numerous scheme analyses and multi-scheme mapping, the use of this model will help to improve work efficiency and save mapping time. In the process of designing the scheme, we can also weigh various design conditions, synthesize various factors, and obtain a variety of schemes. Furthermore, we can design questionnaires and send them to nearby residents for scoring in combination with the images of the design scheme. By combining spatial information with information on the evaluation of urban ruin reuse design plans by residents, we can effectively guide the planning and design from the perspective of the users’ perceptions by making use of the laws of ML’s perception and evaluation of the design plans.
4.3. Smart Building and Landscape Design
At the same time, in the future knowledge economy society, important production sites will be transformed into intelligent buildings and landscapes. In the future, with the development of architecture and landscape science and technology, we will further focus on protecting the environment, saving resources, reducing energy consumption, and improving the production and living conditions of human society. We will strive to develop and apply high-tech methods and build intelligent, energy-saving, ecological, and other new buildings, and landscapes, and fully meet the needs of society. Intelligent buildings and landscapes are the combination of information technology and architectural technology. The last stage of the sustainable reuse of URs is the specific project implementation stage, which is divided into three dimensions: intelligent architecture and landscape, intelligent service, and intelligent management. The specific project implementation forms a sound operation and ecological monitoring system. At this stage, it is also necessary to focus on the overall analysis of existing buildings, including the residual technical performance and the degree of material degradation, in order to maintain and renovate existing materials as much as possible, minimize the impact of the demolition of old buildings and new construction waste on the environment, and try to use energy-saving and environment-friendly materials in the new buildings.