1. Introduction
Trading is one of the most primitive and fundamental social relations in human society. In the contemporary cities, retail is a fundamental activity, either from an economic, social, or spatial point of view. The spatial structure of cities is closely linked to the location and spatial pattern of retail establishments. The importance of this activity is highlighted in the literature as attracting movement, creating centralities, as well as reinforcing the revitalization of urban areas.
Retail location is a central topic in market geography. Once a retailer has decided on where to put a store, and how big to build it, he has well set at least the range of its future profits [
1]. According to Clarke, the location decision is the most important one, because the retailer must live with it and handle it continually to ensure profitability [
2]. Many studies seek to identify the logic of the spatial location and distribution of trade in cities. Among these approaches are the gravitational models, pioneered by William J. Reilly, whose work was published in 1931. This author associated Newton’s universal law of gravitation to what he called attraction force between two retail centers and their potential clients. Later, David Huff improved this idea, including in his equations the attractive potential of several establishments competing by customer preference. Over time, studies have been seeking to explore the importance of several attributes that hold influence in consumer’s choice [
3]. Recently, classic models as Huff have been integrated to the computational environment through the Geographic Information Systems (GIS), associated with a greater availability of georeferenced urban data.
This paper adopts a retail gravitational approach (Huff model) exploring a methodology for retail spatial analysis in a GIS environment. The main objectives are: (a) discussing the influence of different store location attributes in consumer choice among a group of retail offers, and (b) developing an empirical application for the clothing retail business sector in the city of Santa Maria, RS, Brazil. In this study, we will emphasize urban location attributes rather than store attributes, focusing on three aspects of urban form: store clustering, accessibility, and topographic slope. The software ArcGIS® was used as the integrating environment for different analysis tools.
This paper is aiming to enrich the traditional modelling of retail attractiveness by introducing more qualitative descriptions of retail location: location is not only about the distance to the consumers, but also to some urban form attributes, such as street grid connectivity, relative position, terrain slope, spatial agglomeration, etc.
The remainder of this paper is organized as follows. First, the theoretical background of the paper is presented, focusing on the issue of retail location.
Section 3 defines the methodology of the study, including the process of modelling, selected variables, and empirical data.
Section 4 presents the results and discussion of the empirical application in Santa Maria. The final section concludes the main findings and the limitations and potentialities of the methodology, and possible future studies in this topic.
3. Methods
GISis on the rise for its ability to cross data from different sources and reveal underlying spatial patterns. Fast and low-cost computers associated with the growing availability of urban data and information allow for the application of more detailed gravitational models, exploring the GIS functionalities to operate with data in different geographic scales [
24]. Indeed, the advances in technology lead to an increase of functionalities that GIS can now perform, which allow new approaches in urban studies [
25]. This paper proposes an application of the Huff model included in the ArcGIS
® 10.3.1 package (ArcGIS Market Analysis Tools. ESRI—Environmental Systems Research Institute: Redlands, CA, USA. Version 10.3.1 released on 2015.), aiming to discuss the influence of different store location attributes on consumer choice among a collection of retail offers. As mentioned before, we will focus on the location attributes rather than the store attributes. We do not intend to dismiss the great relevance of store attributes (store size, parking spaces, variety of goods, etc.) in consumer choice, but to emphasize the urban form attributes of the location. To do this, we will take the hypothesis that all stores have a uniform retail profile, in other words, the attraction of each store will be weighted in the Huff model by its location attributes.
To run the empirical application, we selected three main features of urban form related to retail location and defined their respective attributes. The selected features were: the agglomeration pattern of the stores (level of spatial concentration), the accessibility (topological) and, as an environmental feature, the slope of the terrain. We took these parameters as measures of attractiveness in Huff model to estimate the patronage probabilities of each store (location), and then summarized how the total population would be shared between stores. The Huff model calculates this market share by applying probabilities to the population data for all origins.
The empirical study area was Santa Maria, Brazil (
Figure 1). The total urban population of Santa Maria, according to the Santa Maria Institute of Planning based on 2010 Census, was 248,347, 95.1% of the total. The figure emphasizes the location of the city in a national and regional context, and highlights the Downtown area, where 52.4% of the stores analyzed are located. The Downtown area of Santa Maria concentrates the most important commercial streets since 1885, when a railway was built in its vicinity, giving the city the title of “railway city”, and strengthening retail and local services.
Database for the retail stores was obtained from CNEFE (National Register of Addresses for Statistical Purposes) for the year of 2010, provided by IBGE (Brazilian Institute of Geography and Statistics). We adopted the method proposed by [
26] to spatialize data from CNEFE in a GIS environment. According to this database, Santa Maria had, in 2010, a total of 291 clothing stores, but only 245 could be successfully geocoded (84.2%) due to address information problems. The dataset was corrected with Google Earth
® (Google Earth Pro. Google: Mountain View, CA, USA. Version 7.3 released on 2017) images, with the addition of five stores, resulting in a total of 250 stores mapped.
The level of spatial concentration of stores was estimated through Kernel density, a method that creates a continuous field through discrete objects to analyze data concentration [
27]. The terrain slope in degrees was processed in a Digital Elevation Model (SRTM) made available by NASA with a spatial resolution of 30 m.
The spatial representation of Santa Maria street grid was provided by the Department of Architecture and Urbanism of the Federal University of Santa Maria. We performed street grid corrections to fix small inaccuracies. The dataset was based on “street segments”, the geographic space located between two corners due to its high level of detail, considering the unequal distribution of retail shops along the streets. This spatial basis contained 7101 street segments, and this was entered as an input to calculate local accessibility in the software UrbanMetrics
® (UrbanMetrics. Federal University of Pelotas: Pelotas, Brazil. Version 2.2 released on 2016.). Accessibility is a measure of relative distance (topological) from each space to all others in the same spatial system, and we calculated this measure in a local scale, radius 3, i.e., a distance considering three topological steps from each space [
28,
29]. All three parameters (Kernel density, terrain slope and local accessibility) were rescaled to present a range from 1 to 100, in order to allow comparisons.
Table 1 summarizes the study methodology.
The ArcGIS software was the integrating environment for the different tools and data, and for the application of the Huff model. To run the Huff model, we took data on the total population by census tracts (centroids) as origins and all the 250 stores as destinations. Only the urban area of the municipality was included in the study. Using the centroid of the census tracks as an approximation of population location can be problematic, since these geographic areas have different sizes and shapes. Due to the lack of more detailed data on the population spatial distribution, many market area analyses adopt census tracts (or blockgroups) centroids to estimate the distances to stores. In this paper, we only considered the population of the urban area of the city, which represent 96.79% of the total population according to the 2010 Census. In the urban area of Santa Maria, 93.62% of the census tracts had a surface that was smaller than 2 km2. In this sense, this approximation is considered to be appropriate for this experiment.
The model was able to estimate the population share for each store (location), considering each selected attractive factor. According to the literature, model calibration should be based in actual shopping preferences for products, surveyed at the household level. Actual choice behavior (the variable to be predicted) must be obtained empirically from residents to geographic subareas located within a larger study area, and choices must be obtained for all alternatives considered by these residents [
30]. Unfortunately, these data were not available for Santa Maria, being rare in the Brazilian context. According to the Huff model, the exponent for distance is assumed to be negative. Convenience products could be expected to have a larger exponent, while specialty goods would have a much smaller exponent [
30]. Clothing products can be considered as comparison or specialty goods. After a few tests, we adopted the distance constraint parameter (
β) as 1 and Euclidian distances. We performed three separated runs of the model, each run was loaded with one of the three location-attractive factors.
4. Results
4.1. Results for Locational Attributes
The map shown in
Figure 2 revealed a strong concentration of clothing stores in the Downtown area, as expected. Two other concentrations were noticeable at the east and west sectors of the city. We noticed that the cluster of the eastern portion of the city seemed to not be as significant as the west portion, likely because of its late history of urbanization: while the city center and its surroundings exhibit urban expansion from 1902 to about 1933, the East portion began its development in the early 1980s, after the implantation of the largest university of the city in 1960, the inauguration of a base of the Brazilian Air Force in 1970, the incorporation of a district and its transformation into neighborhood in 1982, and finally with the consolidation of a major housing estate in the late 1980s. Meanwhile, in the 1970s, the west portion of the city was already densified, and it represented a consolidated core, enhanced also by the implantation of a housing estate in the late 1980s, and the occurrence of the largest irregular occupation process in the city in the year of 1992 [
31].
The frequency histogram in
Figure 2 reveals a “u” form, which is interesting because it demonstrates that a significant number of stores were in clusters, and another meaningful amount was isolated. For example, 53.2% of all stores were in the more dispersed classes of values, while 23.60% were in the three highest cluster areas.
For local accessibility,
Figure 3 indicates a strong correlation between the most accessible street segments and the retail locations. Of all stores, 71.2% were located on the most accessible routes in the system (above 16.68), while only 1.6% were in the lesser accessible paths. As mentioned in the Methods section, accessibility, or closeness centrality [
13], is a network metric, referring to the relative distance of each node (segment) in the spatial system. In this sense, this property must be seen as an indicator of accessibility, and, according to the literature, related to retail location.
Finally, the terrain slope analysis (
Figure 4) demonstrates the preference of stores to allocate in streets with lower slope degrees. 62.4% of all stores were in slopes that were smaller than 4.56 percent. In this dataset, only 2% of all stores were located in areas with a slope degree that was higher than 9.11 percent. With these results in hand, we set out to apply the Huff model.
4.2. The Huff Model
As mentioned before, we used the centroid of the census tracks as origins for the Huff model, considering data on total population.
Figure 5 is a map of population by census track for elucidation reasons. The clipping of the urbanized area was provided by the Santa Maria Institute of Planning, and it may be important to highlight that it is slightly different from the Brazilian Institute of Geography and Statistics, which provided the Census data for this map.
The figure shows a high density of residential population in the Downtown area and surroundings, as well as two clusters in the east and west regions. We can also notice that a smaller concentration started to grow on the South area.
Figure 6,
Figure 7 and
Figure 8 shows the results of the probability of each store being chosen based on each of the selected parameters.
When analyzing the probability of choice for the store clustering as an attraction variable, we can clearly see that the East cluster lost strength, likely because of its detachment from the city core and the peculiar aspects of the region that we have discussed in
Section 4.1. Meanwhile, the stores located in the center and west parts presented the highest probabilities of being chosen.
Meanwhile, the accessibility analysis for the Huff model had its own particularities. The results were influenced by the local nature of the phenomena. Nevertheless, the range of the probabilities was narrower, as we will discuss later.
Finally, the probability of choice for the terrain slope as an attraction variable showed interesting results. The whole sphere of probabilities seemed to have diminished throughout the city. As we already discussed, this variable was the one with a minimum range of original values, so we can say that it had the most homogeneous behavior among the selected parameters. The Huff model tries to divide the probabilities of choice, but since all units have similar values, the overall result is homogeneous as well.
The impressions that we just discussed are endorsed by
Figure 9, which shows the probability of each store being selected.
The Store Clustering variable had a high frequency at both ends of the graph, suggesting a strong tendency to influence consumer preference when it showed strong agglomeration (positive influence) or store dispersion (negative influence). This interpretation was supported by the analysis of the histograms of the values of each individual variable (
Figure 2,
Figure 3 and
Figure 4), and of the dispersion measures (amplitude and standard deviation) illustrated in
Table 2.
Table 2 reveals, first, that the most homogenous result was that of terrain slope, which indicated a very consistent preference for less steep locations. The range of probability of a store being chosen when considering slope as an attractor variable was the smallest one for the set, followed by local accessibility and store clustering, in that order. The same occurred for standard deviation.
Finally,
Table 3 shows the numeric results for the Huff model estimated population share.
We can observe that, while the parameters of local accessibility and terrain slope presented similar behaviors—with a difference of 8.47% for the best performance, and 7.21% for the worst—, we cannot say the same about store clustering. The top 10 stores with the best performances when considering only store clustering as an attraction variable couldattract 31.20% more people than when local accessibility was considered as the attraction variable, and 37.02% more than the terrain slope. The significance of this parameter was also suitable for the negative effects. The difference of the attracted population was of a negative degree of about 94%, when comparing with both the local accessibility and the terrain slope.
It is important to emphasize that the approach we adopted, due to the lack of actual consumer patronage data, represents purely a modelling output of potential attraction based on the selected urban criteria.
5. Final Remarks
This paper sought to discuss the influence of different store location attributes on the consumer choice. As methodological approach, we selected three location attributes—spatial agglomeration, local accessibility, and terrain slope—as measures of attractiveness in the Huff model to estimate the patronage probabilities of each store (location).
Regarding these location attributes, the study was able to show different behaviors of the clothing stores in Santa Maria. Our results reveal that most stores tend to choose locations with similar profiles, i.e., flatter and accessible to their neighborhoods, confirming literature on retail preferences [
15,
16]. We also identified only three significant retail clothing agglomerations in Santa Maria, and half the stores were relatively dispersed in the urban areas.
The model results on attractiveness allowed for the identification of the likely performances of each store location attribute and their spatial hierarchy. The stores could be ranked by their probabilities of being chosen, the amount of attracted population, and their statistical profiles, for each location attribute. Our results indicate that, for this dataset, the agglomeration attribute is by far the one that has more influence in the probability to attract customers. Locations with high values of clustering had an outstanding performance compared to the other studied attributes, while the ones with low values had the worst performance.
It is relevant to mention that unplanned retail agglomeration is a bottom-up urban attribute that emerges through decentralized decisions of many retailers taking location decisions to maximize their profits. Retail agglomeration tends to reduce consumer search costs, creating a critical mass of retail space for comparison shopping [
32]. In areas of retail agglomeration, land prices tend to be higher, selecting for retailers that able to afford this cost. In this sense, store agglomeration tends to be a spare attribute, related to urban centralities and hierarchy.
In this paper we explored the Huff model to discuss urban form attributes of store locations. This application differs from the traditional use of the model, in which the attractiveness variable generally refers to the store itself, and the urban attributes appear as distance or travel time. The application allowed for different scenarios of probabilities for consumer patronage to be built, and discussion of the performance of selected urban form parameters.
In order to implement the application, some simplifications were adopted. We relied on a preliminary calibration of the Huff model based on theoretical assumptions, since no empirical research on consumer patronage was available. We must say that the variables used in this work were adequate, but future studies can enrich the methodology with the addition of other variables described in the literature as attractive factors.
Lastly, the task of urban planning in managing the relationship between retail and urban spaces is quite complex. Studies of the kind presented here can contribute to a better understanding of that relationship.