A Recommendation System in E-Commerce with Profit-Support Fuzzy Association Rule Mining (P-FARM)

Dogan, Onur

doi:10.3390/jtaer18020043

Open AccessArticle

A Recommendation System in E-Commerce with Profit-Support Fuzzy Association Rule Mining (P-FARM)

by

Onur Dogan

^1,2

¹

Department of Mathematics, University of Padua, 35121 Padua, Italy

²

Department of Management Information System, Izmir Bakircay University, 35665 Izmir, Turkey

J. Theor. Appl. Electron. Commer. Res. 2023, 18(2), 831-847; https://doi.org/10.3390/jtaer18020043

Submission received: 19 January 2023 / Revised: 21 March 2023 / Accepted: 24 March 2023 / Published: 6 April 2023

(This article belongs to the Special Issue Digital Resilience and Economic Intelligence in the Post-Pandemic Era)

Download

Browse Figures

Versions Notes

Abstract

:

E-commerce is snowballing with advancements in technology, and as a result, understanding complex transactional data has become increasingly important. To keep customers engaged, e-commerce systems need to have practical product recommendations. Some studies have focused on finding the most frequent items to recommend to customers. However, this approach fails to consider profitability, a crucial aspect for companies. From the researcher’s perspective, this study introduces a novel method called Profit-supported Association Rule Mining with Fuzzy Theory (P-FARM), which goes beyond just recommending frequent items and considers a company’s profit while making product suggestions. P-FARM is an advanced data mining technique that creates association rules by finding the most profitable items in frequent item sets. From the practitioners’ standpoints, this method helps companies make better decisions by providing them with more profitable products with fewer rules. The results of this study show that P-FARM can be a powerful tool for improving e-commerce sales and maximizing profit for businesses.

Keywords:

recommendation systems; fuzzy association rule; profit-support; profit-confidence; e-commerce

1. Introduction

The growing e-commerce market is a significant interest for managers to produce technology to facilitate customers’ buying the products they need. With the improvement of the digital economy, the competition in the business environment gets more complicated. Therefore, e-commerce companies need to be supported by choosing the right products to draw the attention of online customers [1]. Customers are often confused when deciding on the product to purchase, as a wide range of products is offered. This purchasing confusion triggers developing a product recommendation system [2]. Sales transactions are analyzed to understand customers’ visit purposes related to product preferences. Association rule mining (ARM) is a popular and powerful method that allows identifying relationships between purchase preferences. It analyzes historical transactions called item sets purchased by customers and creates association rules among items. According to the associations, product recommendations can be generated for customer preferences. Association rule mining methods, such as the apriori algorithm, are performed by counting the sales amount together. However, these algorithms ignore the items’ profit. It is not feasible to reveal the more profitable items with infrequent sales volume. Hence, the possibility of profit improvement gets lower using the traditional recommendation systems.

Discovering an association among item sets in many e-commerce sales is not straightforward because of the data explosion in the big data era [3]. There are many overlapping intersections among datasets, and the boundaries between them are fuzzier. Traditional association rule mining methods cannot overcome the overlapping, and boundary problems [4]. The fuzzy-based methods can manage the uncertainty and produce more accurate solutions [5]. Fuzzy logic is a sub-domain of artificial intelligence, referred to as multi-valued logic. It presents a robust way of describing the concept of vagueness. Instead of having binary-valued transactions (true and false), a continuum of possible truth values has existed. In fuzzy logic, every statement has a membership between 0 and 1 instead of having two possible values, such as true and false. Therefore, adapting fuzziness to the association rule mining becomes more critical for a recommendation system [6,7,8]. This study only handles the sales amount thanks to product diversity and missing data about the product feature. However, some product features could also be considered. For instance, assuming an e-commerce company sells books, apart from sales amounts, book type (horror, adventure, autobiography, travel, etc.) and the publication year (recent, new, old, etc.) can also be used for fuzzy logic. Because one book can belong to more than one type or the newness of the book cannot be described strictly, the fuzzy logic approach was chosen to benefit from its fuzziness.

The primary motivation behind this study is to address the limitations of existing methods for product recommendations in e-commerce systems, which only consider the sales frequency rather than the products’ profitability. The goal is to provide a more comprehensive and practical solution for e-commerce companies, as profitability is a crucial aspect of any business. The proposed P-FARM method aims to enhance decision-making and maximize company profits by considering both the frequency of sales and profitability. The study seeks to bridge the gap between existing methods and real-life business requirements and provide a more accurate and valuable product recommendation system for e-commerce companies. From this perspective, this study contributes to the literature twofold. First, it proposes a novel association rule mining method that overcomes overlapping and boundary problems in the real-life application with fuzziness. Second, it considers profit instead of sales volume in traditional association rules. The proposed novel method is called profit-support fuzzy association rule mining (P-FARM). It adopts the relationship between the profit and sales amount of each item. Considering profit instead of item sets is mainly ignored in the previous studies. However, e-commerce companies should also consider their profits while proposing products to visitors. In this way, profit values used as support inputs are converted into fuzzy numbers to define them in a more appropriate case, like in real-world implementations.

Association Rule Mining (ARM), Profit-Support Association Rule Mining (P-ARM), and Profit-Supported Fuzzy Association Rule Mining (P-FARM) are three approaches that can be used for generating association rules in the e-commerce domain. ARM is a traditional data mining technique that focuses on identifying frequent itemsets and association rules. It uses support and confidence metrics to identify the most relevant rules. P-ARM is an ARM extension that considers the items’ profit in addition to the support and confidence metrics. P-ARM uses the Profit-Support metric, which is calculated by dividing the total profit of an item by its support value. P-FARM is an improvement of the P-ARM approach that further extends the fuzzy logic theory. P-FARM uses a novel approach to calculate the support of an item that considers both the traditional support and the item’s profit. It then uses a profit potential coefficient to calculate the fuzzy profit support of an item. P-FARM thus provides a more comprehensive way to mine association rules by considering both the frequency and profitability of the items. In summary, ARM focuses on identifying frequent itemsets and association rules based on support and confidence, P-ARM takes into account the profit of the items in addition to the support and confidence, while P-FARM extends the approach further by considering both the traditional support and the profit of the item to calculate the fuzzy profit support.

The structure of the paper is organized as follows. Section 2 gives a quick review on recommendation systems for e-commerce domain. Section 3 presents background of association rules, fuzzy association rules. Section 4 introduces the proposed methodology, profit-support fuzzy association rule mining (P-FARM). Section 6 compares the effect of the proposed P-FARM and P-ARM with a numerical experiment. Section 5 shows the implementation of the proposed method to verify its validity. In Section 7, the study is concluded by giving highlights and limitations.

2. Literature Review

The most popular use of association rules is analyzing customer transaction data to identify relations between purchased products. The main aim of association rules mining is to support sales. Ref. [9] introduced an effective method to create important association rules between products purchased. Various algorithms were improved to find association rules in large databases, such as the AprioriHybrid algorithm [10], direct hashing and pruning algorithm [11], frequent pattern-growth algorithm [12], cluster-based association rule algorithm [13], integrating web traversal patterns and association rules [14], and matrix and interestingness-based association rule mining [15]. Table 1 summarizes previous studies about association rules.

Since the data type used may affect the data preparation and the methods to be applied for the study, the researchers discovered the association rules with different data types. Some frequent pattern algorithms have been introduced to extract information from streaming data. These algorithms involve significant data mining techniques such as clustering [16,17], classification [18], prediction [19] frequent pattern mining [20] and time series analysis [21]. Studies using stream data frequently developed a new association rule algorithm by handling the problem of the time window. The sliding window is a broadly applied approach for data stream mining thanks to its importance on recent data and bounded memory requirements. A transactional sliding window aims to retain a fixed-size window over a data stream [22,23,24,25]. Web log data are another popular data type used in association rule mining. The web log data consists of a series of events where each recording describes the session with particular page navigation [26,27,28,29].

Various researchers extended traditional association rule mining algorithms with fuzzy theory [26,30,31]. A fuzzy set can recompense some of the limitations of the association rule methods. This research proposes a generic model to find association rules by fuzzifying transactions, called fuzzy association rules. Computing the support and accuracy of fuzzy association rules is the main difficulty [8,32,33,34,35]. Ref. [36] developed a recommendation system using hybrid fuzzy association rules to identify the significant user navigation pattern from the clustered frequent patterns in tourism sector via a questionnaire. Ref. [31] proposed a fuzzy c-means clustering method to create association rules by combining the Apriori algorithm. They focused on customer ratings instead of frequent item sets in the telecom area. In the media sector, Ref. [26] proposed a fuzzy inference system including a set of rules from the clustered pattern for identifying the significant user navigation pattern. Ref. [23] integrated fuzzy theory with data streams employing a sliding window approach to analyze association rules. In the e-commerce domain, visitors follow different navigation paths on the website and visit different pages in different order and frequency [37]. Ref. [38] designed a personalized recommendation system using fuzzy association rules in the e-commerce domain.

Some researchers utilized clustering approaches to create customer profiles and then created association rules for the clustered customers [31,39,40]. Clustering results were mainly used to provide personalized recommendations [41,42], optimize a website structure [43,44], and improve a customer-oriented strategy [45,46].

Most research in the literature focused on overcoming binary-valued transaction data [22,36,47,48]. Yet, transaction data in real-world cases mainly include fuzzy and quantitative values. Consequently, some fuzzy-oriented association rules algorithms were introduced [6,26,36,49,50]. Ref. [51] introduced a group recommendation system to achieve suitable membership functions and practical association rules from a database that includes uncertain data. The apriori algorithm was updated with fuzzy theory to get membership functions with more effective results. Ref. [52] used fuzzy association rules to design a recommendation system. The apriori algorithm was improved with discretization based on a clustering algorithm to express quantitative results in a nominal variable matrix. A fuzzy recommendation algorithm was proposed by combining quantitative association rules and fuzzy rules to predict the product that will be recommended.

Table 1. Previous studies on association rules.

Study	Data Generator	Method	Fuzzy	Focus	Domain	Explanation
[53]	Stream data	Novel	-	Time window	N/A	It introduced FP-stream, an effective FP-tree-based model for mining frequent patterns from data streams.
[47]	Stream data	Novel	-	Time window	E-commerce	It created all recent frequent patterns from a high-speed data stream over a sliding window.
[54]	Stream data	Novel	-	Time window	N/A	It enabled defining time windows’ number, size and weight.
[55]	Stream data	Novel	-	Time window	N/A	It presented a novel algorithm with normalized weight over data streams and tree structure that stores compressed crucial information about frequent item sets.
[24]	Stream data	Novel	-	Time window	E-commerce	It proposed a new algorithm which is suitable for observing recent changes in the set of frequent item sets over data streams.
[56]	Secondary	FS	+	Item set	Finance	It predicted the level of the stock market after the associations among different parameters are extracted
[48]	Web log	Novel	-	Time window	Education	It proposed the usage of a specific density-based algorithm for navigational pattern discovery.
[28]	Web log	FS	+	Item set	N/A	It used the 2-tuple linguistic description to create association rules at the intersection of fuzzy set boundaries.
[29]	Web log	FS	+	Item set	6 Domains	It discovered frequent fuzzy–probabilistic item sets and fuzzy association rules using a novel algorithm.
[38]	Secondary	FS	+	Item set	E-commerce	It designed a personalized recommendation system using fuzzy association rules.
[57]	Secondary	ARM	-	Item set	Retail	It calculated new support and confidence values based on “profit” to create interesting patterns.
[23]	Stream data	FS	+	Item set	Retail, Transportation	It integrated fuzzy theory with data streams, employing sliding window approach, to analyze association rules.
[22]	Stream data	Novel	-	Item set	Sport	It analyzes frequent patterns from real-time transactions with the sliding window technique.
[25]	Stream data	Novel	-	Item set	Retail	It proposed a new algorithm that focuses on keeping self-consistency of the discovered item sets.
[26]	Web log	FS	+	Item set	Media	A fuzzy inference system was generated, which includes a set of rules from the clustered pattern for identifying the significant user navigation pattern.
[50]	Secondary	CF	-	Item set	Education	It developed a recommendation system for students’ programming skills.
[36]	Surveyed	CF	-	Ratings	Tourism	A novel hybrid recommendation algorithm (HyRA) was introduced with point of interest and geographical information.
[31]	Web log	CF, FCM	+	Ratings	Telecom, e-commerce	It proposed the topological representation of tree-structured taxonomy and the statistical properties of the taxonomy.
[27]	Web log	CF	-	Ratings	Media	It described a new recommendation algorithm based on probability matrix factorization.
[58]	Secondary	FS	+	Item set	E-commerce	It considers alse sales amount to create asociation rules.

ARM: Association Rule Mining; CbF: Content Based Filtering, CF: Collaborative Filtering; FCM: Fuzzy c-means; FS: Fuzzy sets.

Previous studies focused on only a single

m i n s u p p

. It means that the studies implicitly assume that all items in the database are similar. In other words, items have similar frequencies in the database, which is not valid in the real world. If using previous association rule mining algorithms, two problems will be encountered. First, some rules making few profits will be generated. Second, in the first iteration of the Apriori algorithm to yield a 1-item set, some items are deleted, which can make higher profits but have lower support. In terms of profit, even though the sale of some items has occurred only a few times (less than the predefined

m i n s u p p

), they can be more important (e.g., much more expensive) than the others, which have occurred more frequently. For example, in a clothing store, a designer gown may have been sold only a few times but has a significantly higher value than the other clothing items sold more frequently. In this case, even though the designer gown may not have reached the predefined minimum number of sales, it would still be important to consider it for analysis as it can potentially contribute more to the store’s profit. Because of that, Ref. [2] focused on the multiple minimum supports to mine association rules considering the profit impact on the frequencies. They used a synthetic data set with 1000 items and 10,000 transactions. In the same focus, Ref. [57] proposed profit support and profit confidence by regarding the actual profit and averaging the total profit of each item. They used a sample data set with five transactions, including five products. Although [2,57] focused on profit-based association rules, this study improves their models under vagueness because transaction data in real-life mainly involve fuzzy and quantitative values. Moreover, this research uses real-world data to test the proposed methodology with 834,047 sales transactions, including 339 products generated by above 460,000 customers.

This study stands out from previous related works, which can be categorized into fuzzy association rules models and profit-support association rule models. It enhances fuzzy association rule models by incorporating profit considerations and expands the fuzzy theory to create profit-supported association rule models. The proposed method modifies classical support to include profit information, which determines the significance of items. The study introduces a novel model, P-FARM, for mining profit-supported fuzzy association rules in e-commerce to suggest more profitable products based on visitors’ interests.

3. Preliminaries

3.1. Association Rules

Let I be a set of items and T be a set of transactions with items in I. Both I and T are assumed finite sets. An association rule is defined as

X \to Y

, where X and Y are a subset of items (

X \subseteq I

and

Y \subseteq I

), non-empty sets (

X \neq \emptyset

and

Y \neq \emptyset

) and mutually exclusive (

X \cap Y \neq \emptyset

). So, the rule of

X \to Y

indicates that every transaction of T includes X and Y.

Association rules are defined by two measures, support and confidence. A low support value causes two undesired results: generating rules by chance and defining uninteresting rules [59]. Moreover, the support threshold affects the creation of frequent item set generation. On the other hand, confidence validates the reliability of the judgment made by the rule. The confidence threshold affects the generated rules [60].

The support of an item set

I_{0} \subseteq I

is the probability that a transaction of T includes

I_{0}

(Equation (1)).

s (I_{0}, T) = \frac{|\{μ \in T | I_{0} \subseteq μ\}|}{|T|}

(1)

where

I_{0} \subseteq I

, and

μ

is a subset of transaction T. The support and confidence of the association rule

X \to Y

in T are given in Equations (2) and (3), respectively. S refers to the support value for rules, whereas s shows items. The methods applied to analyze association rules to find rules whose support and confidence values are greater than two thresholds are called

m i n s u p p

and

m i n c o n f

, respectively. These rules are regarded as strong rules, whereas rules with low support and high confidence are called interesting rules.

S (X \to Y, T) = s (X \cup Y)

(2)

c (X \to Y, T) = \frac{s (X \to Y)}{s (X)} = \frac{n (X \cup Y)}{N (X)}

(3)

3.2. Fuzzy Association Rules

Fuzzy association rules are generated by fuzzy triangular numbers. A fuzzy number is a fuzzy set defined on the real numbers ℜ.

Definition 1.

A fuzzy number F is a fuzzy set constrained by a membership function,

μ_{F}

.

μ_{F} : ℜ \to [0, 1]

(4)

The triangular fuzzy number, a special type of fuzzy number, has a triangular formation of the membership function. It is represented by a lower, medium and high points as follows:

A = (L, M, H)

. This representation is decoded as membership functions.

μ_{(A)} = \{\begin{matrix} 0, & x < a_{1} \\ \frac{x - a_{1}}{a_{2} - a_{1}} & a_{1} \leq x \leq a_{2} \\ \frac{a_{3} - x}{a_{3} - a_{2}} & a_{2} \leq x \leq a_{3} \\ 0, & x > a_{3} \end{matrix}

(5)

A transaction is a specific form of a fuzzy transaction, and an association rule is a specific form of a fuzzy association rule.

Definition 2.

A fuzzy transaction is nonempty fuzzy subset,

\tilde{μ} \subseteq I

.

where

\tilde{μ} (i)

is the membership degree of

i \in I

in a fuzzy transaction

\tilde{μ}

.

\tilde{μ} (I_{0})

is the inclusion degree of an item set

I_{0} \subseteq I

in a fuzzy transaction

\tilde{μ}

, as given in Equation (6).

\tilde{μ} (I_{0}) = \underset{i \in I_{0}}{m i n} \tilde{μ} (i)

(6)

Definition 3.

A fuzzy association rule

X \to Y

in T holds if and only if the degree of inclusion of Y is greater than that of X for every fuzzy transaction

\tilde{μ}

, like in Equation (7).

\tilde{μ} (X) \leq \tilde{μ} (Y) \forall \tilde{μ} \in T

(7)

where I is a set of items, T is a set of fuzzy transactions. X and Y are two crisp subsets of I (

X \subseteq I

and

Y \subseteq I

), non-empty sets (

X \neq \emptyset

and

Y \neq \emptyset

) and mutually exclusive (

X \cap Y \neq \emptyset

).

4. Proposed Methodology

The proposed method extracts hidden patterns from the e-commerce sales volume and profit, called Profit-Support Fuzzy Association Rule Mining (P-FARM). Figure 1 shows the proposed methodology. It consists of three stages: ETL (Extract, Transform, Load) process, data analysis, and rules.

Step 1: The formation of the database, the first stage, begins with data collection.

Step 2: The second step is transforming quantitative values into fuzzy numbers. The membership values of an item i can be defined as a set of

μ_{i} = \{μ_{i L}, μ_{i M}, μ_{i H}\}

. Equation (8) is used to transform crisp numbers to fuzzy numbers, where

x_{i}

presents the sales amount of item i. The values in the membership set indicate fuzzy membership degrees for low, medium and high classes, respectively.

μ_{i} = \{\begin{matrix} \{\frac{l o w_{u p p e r} - x_{i}}{l o w_{u p p e r}}, 1 - \frac{l o w_{u p p e r} - x_{i}}{l o w_{u p p e r}}, 0\}, & i f x_{i} \leq l o w_{u p p e r} \\ \{0, \frac{m e d i u m_{u p p e r} - x_{i}}{m e d i u m_{u p p e r}}, 1 - \frac{m e d i u m_{u p p e r} - x_{i}}{m e d i u m_{u p p e r}}\}, & i f l o w_{u p p e r} < x_{i} \leq m e d i u m_{u p p e r} \\ \{0, 0, 1\}, & i f x_{i} > m e d i u m_{u p p e r} \end{matrix}

(8)

Step 3: The fuzzy profit support (FPS) values are obtained.

Let

Z = {z_{1}, z_{2}, \dots, z_{n + m}}

be a set of variables and

c_{i, j}

an arbitrary fuzzy set associated with attribute

z_{j}

in Z. In Equation (9),

z_{j} : c_{i, j}

indicates a fuzzy item and

Z : C

refers to a fuzzy item set where C shows the corresponding set of some fuzzy intervals.

〈Z : C〉 = [〈z_{i 1} : c_{i 1, j}〉 \cup \dots \cup 〈z_{i q} : c_{i q, j}〉], q \leq n + m

(9)

The fuzzy item sets are used to determine the fuzzy support values. The tuple

t_{i}

of the dataset includes the value of

t_{i} (z_{j})

for the attribute

z_{j}

. Hence, the fuzzy support values of

〈Z : C〉

are calculated by the minimum operator (Equation (10)) or the product operator (Equation (11)).

F S (Z : C) = \frac{\sum_{i = 1}^{I} m i n_{〈z_{j} : c_{i, j}〉 \in 〈Z : C〉} t_{i} (z_{j})}{N}

(10)

F S (Z : C) = \frac{\sum_{i = 1}^{I} \prod_{〈z_{j} : c_{i, j}〉 \in 〈Z : C〉} t_{i} (z_{j})}{N}

(11)

Step 4: The frequent item sets are found. A frequent item set can be described as a set with fuzzy support values higher than a user-defined minimum support threshold. An algorithm is necessary to reduce the number of possible item sets because of high numbers. This study applies the Apriori algorithm to create frequent item sets. It is a stepwise algorithm that commences with obtaining the frequent 1-item set and iteratively creates new candidates utilizing the frequent items discovered in the previous iteration [61].

Potential Profit Coefficient (PPC) must be computed by Equation (12) to adapt the profit parameter into the fuzzy support. It is the proportion of the total profit of item i and the average profit of total items.

P P C_{i} = \frac{P_{i}}{\bar{P}} = \frac{\sum_{t = 1}^{T} P_{i t}}{\frac{\sum_{i = 1}^{I} \sum_{t = 1}^{T} P_{i t}}{N}}

(12)

FPS values are obtained by multiplying fuzzy support value and potential profit coefficient similar to the given in Equation (13). FPS values are used to decide frequent item sets.

F P S = F S {(Z : C)}_{i} \times P P C_{i}

(13)

Step 5: All possible combinations of frequent item sets are considered to calculate the fuzzy confidence values before producing fuzzy association rules using Equation (14).

F C (〈X : A〉 \Rightarrow 〈Y : B〉) = \frac{F S (〈X : A〉 \cup 〈Y : B〉)}{F S (X : A)}

(14)

Step 6: The candidate item sets with higher confidence values than the threshold are put into the association rule repository.

Step 7: The information discovered from the association rules at the end of the six steps can be used to develop an e-commerce company’s profitability by offering more appropriate products.

Discovering fuzzy association rules is a process of obtaining the consequents and predecessors of a frequent item set. It is stated as if

X : A

then

Y : B

. A fuzzy association rule is essential when support and confidence values are higher than predetermined thresholds.

This study does not apply the well-known Fuzzy Association Rule Mining (FARM) method. FARM is a data mining technique that uses fuzzy logic to extract association rules from data. In traditional association rule mining (ARM), items are considered either present or absent in a transaction. However, in FARM, the degree of membership of each item in a transaction is represented by a fuzzy set. It allows for more flexibility and expressiveness in modeling uncertain and imprecise data. The proposed methodology improves the traditional FARM technique by adding profit support values after converting them into fuzzy numbers. Different extensions are presented in the Proposed Methodology section step by step. As an example of these extensions, FPS in step 3 with Equation (9) and PPC with Equations (12) and (13) are calculated, which are ignored in FARM studies. The novelty of this study is to recommend products that are not only the most frequent and relevant but also the most profitable.

5. Case Study: E-Commerce Sales Based on Profit

5.1. ETL Process of E-Commerce Sales

A real-world case was presented in the e-commerce domain to demonstrate the usefulness of the proposed model that creates association rules from the sales data with profit information. The model utilized over one million sales transactions, including 339 different products, and was generated by over 460,000 customers from an international e-commerce company to explain the mechanism of the proposed model. 834,047 sales data remained for analysis after some data preparation steps.

The first stage of the P-FARM commences with data preparation actions. Transactions that have only one sort of product were neglected as this research examines associations among products. Each transaction involves at least two types of products. The proposed P-FARM approach considers the profit of each item and the sales amount. A typical FARM method includes only sales amounts, and a traditional ARM method considers binary variables, sold or not.

The only attribute used in this case is the sales amount. However, profit was also calculated because this study focuses on developing profit-support association rules. In the case study, managers of the e-commerce company decided on the minimum support and confidence thresholds, which are 500 items and 0.45, respectively, for the sales volume.

5.2. Data Analysis

In the second stage, firstly, fuzzy support values were computed for the 1-item set of products and given in Table 2. The minimum FPS value was determined by managers as 500 items in all transactions. This means that a product is assigned to the frequent 1-item set if the maximum TFS of the product is larger than the minimum FPS value. 47 out of 399 products were found frequent when considering fuzzy classes, including sales amounts. When the minimum FPS increases, frequent item sets include fewer products. Interestingly, products in the “High” fuzzy class are absent in the 1-item frequent set. It means that the TFS of the “High” class never exceeded the minimum FPS in all transactions when profit was considered.

According to the Apriori principle, the algorithm eliminates items with a lower support value than the minimum FPS from the frequent item set. The frequent 2-item set was created by computing all possible combinations of the frequent 1-item set using the Apriori algorithm. Table 3 shows the fuzzy support values of the 2-item set. Twenty different products from the 1-item frequent set showed an important combination concerning the minimum FPS. Product 1065M, 1180M and 1099L are the most critical product because it was frequently purchased together with other products. The combination of products 1065M and 1099L has the highest FPS. Selling these two products in the “Medium” and “Low” fuzzy class, respectively, results in a high profit for the company.

The algorithm stopped because none of the 2-item sets could meet the minimum FPS threshold.

5.3. Rules

The third stage discovers and stores profit-based fuzzy association rules. All potential association rules for each frequent itemsets were extracted, and the minimum fuzzy confidence threshold (F-conf), 45%, was checked for each discovered rule. Table 4 presents the fuzzy association rules and the corresponding fuzzy confidence values.

The rule of “

I f {1165 M, 1164 M} t h e n {1127 L}

” indicates that if product 1165 and 1164 are sold together in the “Medium” fuzzy class (up to eight items), then product 1127 is sold in the “Low” fuzzy class. It means that the company should recommend to the customer product 1127 up to four items to gain a higher profit. These results confirm that the proposed P-FARM approach produces much information about e-commerce sales for decision-makers.

6. Experimental Comparison and Discussion

Table 5 shows a small part of sales transactions for experimental comparison of three different methods, ARM, P-ARM, and P-FARM. It includes a total of 17 transactions and shows the corresponding product IDs, quantities sold, unit profit, and total profit for each transaction. Total profit was calculated by multiplying the quantity and unit profit.

Table 6 presents an evaluation of the P-ARM approach created by reformatting Table 5. It summarizes the products sold in each transaction and identifies products above the minimum threshold, indicated by a green background. It shows each product’s Count, Quantity, and Total Profit, which were obtained from Table 5. Each product’s Potential Profit Coefficient (PPC) was computed using Equation (12). For example, the PPC of product 1002 is calculated by dividing the Total Profit obtained from product 1002 and the average profit received in five transactions:

P P C_{1002} = (140 + 200) / (1543 / 5) = 1.10

. It also presents the Profit Support derived from the P-ARM approach for each product. It considers both the classical support and profit information and is used to identify important items for association rule mining.

Table 7 shows the fuzzy transformation of Table 6 using a fuzzy set consisting of triangular fuzzy numbers

L = (0, 0, 4), M = (0, 4, 8), H = (4, 8, 8)

. For example, ‘1’ product 1002 was sold in transaction 401. According to the predefined fuzzy set, sales volume of ‘1’ can belong to ‘Low’ and ‘Medium’ sales sets. The crips number ‘1’ is transformed into the fuzzy numbers, calculated as

μ_{1} = (0.75, 0.25, 0.00

) using Equation (8). Similarly, all sales amounts in the dataset were converted into the corresponding fuzzy numbers. The amount of fuzzy numbers for each class was counted to determine the fuzzy class of an item. The maximum amount was used to define the Final Fuzzy Class (FFC) and gives the Fuzzy Count. Product 1002 was renamed ‘1002H’ because the maximum Fuzzy Count belonged to the ‘High’ class with 1.75. Then, the Frequency Profit Support (FPS) was calculated by multiplying the Fuzzy Count and PPC to determine frequent item sets. Products over the support threshold were assigned to the frequent 1-item set.

According to the traditional ARM (Count row) in Table 6, all products, apart from 1002 and 1092, are frequent because the sales amounts are above the minimum support value, 3. On the other hand, because P-ARM changes the frequencies, some crucial changes occurred in the illustrative example. Whereas products 1080 and 1095 are in the frequent 1-itemset, they were left out of the P-ARM’s frequent 1-itemset. By chance, no product was included in the P-ARM method. The profit support measure considers both the classical support and profit information and provides a better assessment of the importance of products. The changes in the Count and Profit Support values can impact the results of the association rule mining. In this case, the products above the threshold are critical because they contribute significantly to the frequent item sets and the derived association rules. Hence, the effect of the threshold value will be disscused.

Table 8 indicates the details of the comparison. ARM, P-ARM and P-FARM methods created 4, 3 and 2 products, respectively, in the illustrative example. The average profits of the recommended products by ARM, P-ARM and P-FARM are 7.25, 15.33, and 19.5 units of currency, respectively.

The P-FARM algorithm improved the results obtained from the P-ARM approach. Specifically, it resulted in changes in the frequency of certain products. In this illustrative example, products 1092 and 1093 became more frequent with applying the P-FARM algorithm, while other products did not. It could be attributed to the fact that the P-FARM algorithm considers the items’ support value and their profit by adding profit support values after converting them into fuzzy numbers. P-FARM recommended products 1092 and 1093 in this illustrative example. 1093 was recommended because it was also recommended in ARM and P-ARM. Because of the high profit of product 1092, it was listed in the recommended product list for P-ARM and P-FARM. The critical point is that product 1003 was recommended by ARM and P-ARM. Still, when the fuzziness of the sales amount and profitability was considered, it was excluded from the recommendation list. P-ARM listed it because of its Profit Support. However, it was out of the recommendation list by P-FARM when Fuzzy Profit Support was calculated, which indicated it had a low level of profitability. Because sales amounts of product 1003 are in different Fuzzy Classes, ‘Medium’ and ‘High’. It resulted in a lower Fuzzy Count than a crisp count. Overall, the P-FARM algorithm identified that although they did not have a high support value, they had high-profit values, which made them more critical than other items.

The number of recommended products depends on the minimum support threshold. Figure 2 depicts the effects of minimum support values on the methods. In every case, P-FARM is better or at least better than others. It means the average profit of the recommended products by P-FARM is better than those by ARM and P-ARM. Because ARM and P-FARM did not become applicable, the results yielded with

m i n s u p p

with five should be ignored.

7. Conclusions and Future Directions

Rapidly growing technologies for the e-commerce domain necessitate developing novel algorithms and methodologies to investigate customer data. Due to customers’ varying demands and companies’ highly competitive environment, little progress obtained by advanced methods may present companies with ample opportunities contrary to traditional methods. E-commerce companies need to understand users’ visit purposes to gain competitive advantages. One way to learn customers’ visit purposes is to analyze their purchased products and discover associations among them. Traditional studies such as Association Rule Mining (ARM) and Fuzzy Association Rule Mining (FARM) can produce some rules regarding customer transactions. Whereas ARM focuses on products purchased together, FARM improves ARM by considering purchased amounts. However, previous studies ignore profitability while they create association rules. This study proposes a novel approach called Profit-Support Fuzzy Association Rule Mining (P-FARM) to analyze customer transactions by considering company profit.

This study also compares ARM, P-ARM and the proposed P-FARM methods by a numerical illustration. This comparison indicates that the same products can be regarded as frequent in one technique and infrequent in another. Therefore the number of frequent items and consequently generated association rules are different in each technique. Adding extra inputs, such as sales amount and profit parameter, into the technique also decreases the support and confidence values. On the other hand, these additional inputs provide much information for decision-makers about recommending related products.

Whereas ARM generates higher support and confidence values, P-ARM produces lower values because of profit input. Support and confidence values are lower in P-FARM because of fuzziness and potential profit coefficient. Therefore, the number of rules generated is fewer in advanced association rule mining studies. The focus should be on rules instead of confidence values to compare various methods. Although the numbers of the produced rules in ARM (84 rules), P-ARM (49 rules) and P-FARM (23 rules) vary, the rules carry more information in P-FARM because it gives details about profitability and sales volume. FARM produces information about sales volume instead of just “sold or not (binary)” information in addition to the ARM.

The study presents exciting results about which products are sold together will increase profitability more. For example, the combination of products 1065M and 1199L has the highest fuzzy profit support (FPS). Selling these two products in the “Medium” and “Low” fuzzy class, respectively, results in a high profit for the company. The company should recommend product 1127, up to four items, to customers who bought products 1165 and 1164 in the “Medium” fuzzy class for both. These three products are frequently bought items. However, while recommending other relevant items is possible, this recommendation makes the company more profitable.

Further studies can add a “time” perspective to the proposed methodology. Customer demands may change over time. Hence, purchasing time can be considered to improve this research. Since the number of implementations on mining data streams increases, there is a lack of association rule mining on stream data considering profitability. Weblogs can be used for analysis instead of transaction data because transactions indicate only purchased customers’ behaviors. It is critical to know for e-commerce companies the visit purposes of customers who visited the webpage and left without buying anything.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

ARM	Association Rule Mining
CbF	Content Based Filtering
CF	Collaborative Filtering
FARM	Fuzzy Association Rule Mining
FCM	Fuzzy c-means
FPS	Fuzzy profit support (FPS)
FS	Fuzzy sets
P-ARM	Profit-Support Association Rule Mining
P-FARM	Profit-Support Fuzzy Association Rule Mining
PPC	Potential Profit Coefficient

References

Fatoni, C.; Utami, E.; Wibowo, F. Online Store Product Recommendation System Uses Apriori Method. J. Phys. Conf. Ser. 2018, 1140, 012034. [Google Scholar] [CrossRef]
Dai, J.; Zeng, B. An Association Rule Algorithm for Online e-Commerce Recommendation Service. J. Econ. Bus. Manag. 2016, 4, 573–576. [Google Scholar] [CrossRef]
Dogan, O. Heuristic Approaches in Clustering Problems. In Handbook of Research on Applied Optimization Methodologies in Manufacturing Systems; IGI Global: Pennsylvania, PA, USA, 2018; pp. 107–124. [Google Scholar]
Xiang, D.; Zhang, Z. Cross-Border E-Commerce Personalized Recommendation Based on Fuzzy Association Specifications Combined with Complex Preference Model. Math. Probl. Eng. 2020, 2020, 8871126. [Google Scholar] [CrossRef]
Kahraman, C.; Oztaysi, B.; Onar, S.C.; Dogan, O. Intuitionistic Fuzzy Originated Type-2 Fuzzy AHP: An Application to Damless Hydroelectric Power Plants. Int. J. Anal. Hierarchy Process. 2018, 10, 266–292. [Google Scholar]
Öztaysi, B.; Onar, S.Ç. Defining the Factors that Effect User Interest on Social Network News Feeds via Fuzzy Association Rule Mining: The Case of Sports News. In Data Mining in Dynamic Social Networks and Fuzzy Systems; IGI Global: Pennsylvania, PA, USA, 2013; pp. 334–345. [Google Scholar]
Dogan, O.; Oztaysi, B. Gender prediction from classified indoor customer paths by fuzzy C-medoids clustering. In Proceedings of the Intelligent and Fuzzy Techniques in Big Data Analytics and Decision Making: Proceedings of the INFUS 2019 Conference, Istanbul, Turkey, 23–25 July 2019; Springer: Berlin/Heidelberg, Germany, 2020; pp. 160–169. [Google Scholar]
Sharmila, S.; Vijayarani, S. Association rule mining using fuzzy logic and whale optimization algorithm. Soft Comput. 2021, 25, 1431–1446. [Google Scholar] [CrossRef]
Agrawal, R.; Imieliński, T.; Swami, A. Mining association rules between sets of items in large databases. In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, Washington, DC, USA, 26–28 May 1993; pp. 207–216. [Google Scholar]
Agrawal, R.; Srikant, R. Fast algorithms for mining association rules. In Proceedings of the 20th International Conference on Very Large Data Bases, Santiago de Chile, Chile, 12–15 September 1994; Volume 1215, pp. 487–499. [Google Scholar]
Park, J.S.; Chen, M.S.; Yu, P.S. Using a hash-based method with transaction trimming for mining association rules. IEEE Trans. Knowl. Data Eng. 1997, 9, 813–825. [Google Scholar] [CrossRef]
Han, J.; Pei, J.; Yin, Y.; Mao, R. Mining frequent patterns without candidate generation: A frequent-pattern tree approach. Data Min. Knowl. Discov. 2004, 8, 53–87. [Google Scholar] [CrossRef]
Tsay, Y.J.; Chiang, J.Y. CBAR: An efficient method for mining association rules. Knowl.-Based Syst. 2005, 18, 99–105. [Google Scholar] [CrossRef]
Lee, Y.S.; Yen, S.J. Mining web transaction patterns in an electronic commerce environment. In Advances in Web and Network Technologies, and Information Management; Springer: Berlin/Heidelberg, Germany, 2007; pp. 74–85. [Google Scholar]
Deng, X.; Jin, C.; Higuchi, Y.; Han, C.J. An Efficient Association Rule Mining Me-thod for Personalized Recommendation in Mobile E-commerce. In Proceedings of the 1st International Conference on E-Business Intelligence (ICEBI2010); Atlantis Press: Amsterdam, The Netherlands, 2010; pp. 382–389. [Google Scholar]
Gu, X.; Angelov, P.; Zhao, Z. Self-organizing fuzzy inference ensemble system for big streaming data classification. Knowl.-Based Syst. 2021, 218, 106870. [Google Scholar] [CrossRef]
Chen, C.L.; Tseng, F.S.; Liang, T. An integration of WordNet and fuzzy association rule mining for multi-label document clustering. Data Knowl. Eng. 2010, 69, 1208–1226. [Google Scholar] [CrossRef]
Hammoodi, M.S.; Stahl, F.; Badii, A. Real-time feature selection technique with concept drift detection using adaptive micro-clusters for data stream mining. Knowl.-Based Syst. 2018, 161, 205–239. [Google Scholar] [CrossRef] [Green Version]
Sowan, B.; Dahal, K.; Hossain, M.A.; Zhang, L.; Spencer, L. Fuzzy association rule mining approaches for enhancing prediction performance. Expert Syst. Appl. 2013, 40, 6928–6937. [Google Scholar] [CrossRef]
Fernandez-Basso, C.; Francisco-Agra, A.J.; Martin-Bautista, M.J.; Ruiz, M.D. Finding tendencies in streaming data using big data frequent itemset mining. Knowl.-Based Syst. 2019, 163, 666–674. [Google Scholar] [CrossRef]
Wambura, S.; Huang, J.; Li, H. Long-range forecasting in feature-evolving data streams. Knowl.-Based Syst. 2020, 206, 106405. [Google Scholar] [CrossRef]
Kusumakumari, V.; Sherigar, D.; Chandran, R.; Patil, N. Frequent pattern mining on stream data using Hadoop CanTree-GTree. Procedia Comput. Sci. 2017, 115, 266–273. [Google Scholar] [CrossRef]
Moustafa, A.; Abuelnasr, B.; Abougabal, M.S. Efficient mining fuzzy association rules from ubiquitous data streams. Alex. Eng. J. 2015, 54, 163–174. [Google Scholar] [CrossRef] [Green Version]
Deypir, M.; Sadreddini, M.H.; Hashemi, S. Towards a variable size sliding window model for frequent itemset mining over data streams. Comput. Ind. Eng. 2012, 63, 161–172. [Google Scholar] [CrossRef]
Lim, Y.; Kang, U. Time-weighted counting for recently frequent pattern mining in data streams. Knowl. Inf. Syst. 2017, 53, 391–422. [Google Scholar] [CrossRef]
Maheswari, D.U.; Gunasundari, R. User interesting navigation pattern discovery using fuzzy correlation based rule mining. Int. J. Appl. Eng. Res. 2017, 12, 11818–11823. [Google Scholar]
Zhang, P.; Zhang, Z.; Tian, T.; Wang, Y. Collaborative filtering recommendation algorithm integrating time windows and rating predictions. Appl. Intell. 2019, 49, 3146–3157. [Google Scholar] [CrossRef]
Matthews, S.G.; Gongora, M.A.; Hopgood, A.A.; Ahmadi, S. Web usage mining with evolutionary extraction of temporal fuzzy association rules. Knowl.-Based Syst. 2013, 54, 66–72. [Google Scholar] [CrossRef] [Green Version]
Pei, B.; Zhao, S.; Chen, H.; Zhou, X.; Chen, D. FARP: Mining fuzzy association rules from a probabilistic quantitative database. Inf. Sci. 2013, 237, 242–260. [Google Scholar] [CrossRef]
Nagaraj, S.; Mohanraj, E. A novel fuzzy association rule for efficient data mining of ubiquitous real-time data. J. Ambient. Intell. Humaniz. Comput. 2020, 11, 4753–4763. [Google Scholar] [CrossRef]
Zhang, C.; Li, T.; Ren, Z.; Hu, Z.; Ji, Y. Taxonomy-aware collaborative denoising autoencoder for personalized recommendation. Appl. Intell. 2019, 49, 2101–2118. [Google Scholar] [CrossRef]
Chen, T.; Chiu, M.C. An interval fuzzy number-based fuzzy collaborative forecasting approach for DRAM yield forecasting. Complex Intell. Syst. 2021, 7, 111–122. [Google Scholar] [CrossRef]
Zhang, Z.; Huang, J.; Hao, J.; Gong, J.; Chen, H. Extracting relations of crime rates through fuzzy association rules mining. Appl. Intell. 2020, 50, 448–467. [Google Scholar] [CrossRef]
Sarno, R.; Sinaga, F.; Sungkono, K.R. Anomaly detection in business processes using process mining and fuzzy association rule learning. J. Big Data 2020, 7, 1–19. [Google Scholar] [CrossRef]
Kim, J.; Han, M.; Lee, Y.; Park, Y. Futuristic data-driven scenario building: Incorporating text mining and fuzzy association rule mining into fuzzy cognitive map. Expert Syst. Appl. 2016, 57, 311–323. [Google Scholar] [CrossRef]
Alvarado-Uribe, J.; Gómez-Oliva, A.; Barrera-Animas, A.Y.; Molina, G.; Gonzalez-Mendoza, M.; Parra-Meroño, M.C.; Jara, A.J. HyRA: A hybrid recommendation algorithm focused on smart POI. Ceutí as a study scenario. Sensors 2018, 18, 890. [Google Scholar] [CrossRef] [Green Version]
Suchacka, G.; Chodak, G. Using association rules to assess purchase probability in online stores. Inf. Syst. Bus. Manag. 2017, 15, 751–780. [Google Scholar] [CrossRef] [Green Version]
Kuang, G.; Li, Y. Using fuzzy association rules to design e-commerce personalized recommendation system. Telkomnika Indones. J. Electr. Eng. 2014, 12, 1519–1527. [Google Scholar] [CrossRef]
Nenava, S.; Choudhary, V. Hybrid personalized recommendation approach for improving mobile e-commerce. IJCSET 2013, 4, 546–552. [Google Scholar]
Mohammadnezhad, M.; Mahdavi, M. Providing a model for predicting tour sale in mobile e-tourism recommender systems. Int. J. Inf. Technol. Converg. Serv. 2012, 2, 1. [Google Scholar] [CrossRef]
Liu, Z.; Wang, L.; Li, X.; Pang, S. A multi-attribute personalized recommendation method for manufacturing service composition with combining collaborative filtering and genetic algorithm. J. Manuf. Syst. 2021, 58, 348–364. [Google Scholar] [CrossRef]
Cui, Z.; Xu, X.; Fei, X.; Cai, X.; Cao, Y.; Zhang, W.; Chen, J. Personalized recommendation system based on collaborative filtering for IoT scenarios. IEEE Trans. Serv. Comput. 2020, 13, 685–695. [Google Scholar] [CrossRef]
Chen, M. Improving website structure through reducing information overload. Decis. Support Syst. 2018, 110, 84–94. [Google Scholar] [CrossRef]
Cheng, S.; Xu, C.; Dan, H. Website structure optimization technology based on customer interest clustering algorithm. In Proceedings of the IEEE 2008 International Symposium on Computer Science and Computational Technology, Wuhan, China, 12–14 December 2008; Volume 1, pp. 802–804. [Google Scholar]
Chiang, W.Y. Applying data mining for online CRM marketing strategy: An empirical case of coffee shop industry in Taiwan. Br. Food J. 2018, 120, 665–675. [Google Scholar] [CrossRef]
Valle, M.A.; Ruz, G.A.; Morrás, R. Market basket analysis: Complementing association rules with minimum spanning trees. Expert Syst. Appl. 2018, 97, 146–162. [Google Scholar] [CrossRef]
Tanbeer, S.K.; Ahmed, C.F.; Jeong, B.S.; Lee, Y.K. Sliding window-based frequent pattern mining over data streams. Inf. Sci. 2009, 179, 3843–3865. [Google Scholar] [CrossRef]
Guerbas, A.; Addam, O.; Zaarour, O.; Nagi, M.; Elhajj, A.; Ridley, M.; Alhajj, R. Effective web log mining and online navigational pattern prediction. Knowl.-Based Syst. 2013, 49, 50–62. [Google Scholar] [CrossRef]
Lin, C.W.; Hong, T.P.; Lu, W.H. An efficient tree-based fuzzy data mining approach. Int. J. Fuzzy Syst. 2010, 12, 150–157. [Google Scholar]
Yera, R.; Martínez, L. A recommendation approach for programming online judges supported by data preprocessing techniques. Appl. Intell. 2017, 47, 277–290. [Google Scholar] [CrossRef]
Feng, S.; Zhang, H.; Cao, J.; Yao, Y. Merging user social network into the random walk model for better group recommendation. Appl. Intell. 2019, 49, 2046–2058. [Google Scholar] [CrossRef]
Wang, L.; Dong, J.Y.; Li, S.L. Fuzzy inference algorithm based on quantitative association rules. Procedia Comput. Sci. 2015, 61, 388–394. [Google Scholar] [CrossRef] [Green Version]
Giannella, C.; Han, J.; Pei, J.; Yan, X.; Yu, P.S. Mining frequent patterns in data streams at multiple time granularities. Next Gener. Data Min. 2003, 212, 191–212. [Google Scholar]
Tsai, P.S. Mining frequent itemsets in data streams using the weighted sliding window model. Expert Syst. Appl. 2009, 36, 11617–11625. [Google Scholar] [CrossRef]
Kim, Y.; Kim, W.; Kim, U. Mining frequent itemsets with normalized weight in continuous data streams. J. Inf. Process. Syst. 2010, 6, 79–90. [Google Scholar] [CrossRef] [Green Version]
Ho, G.T.; Ip, W.; Wu, C.H.; Tse, Y.K. Using a fuzzy association rule mining approach to identify the financial data association. Expert Syst. Appl. 2012, 39, 9054–9063. [Google Scholar] [CrossRef]
Singh, S.; Badhe, V. Profit Association Rule Mining with Inventory Measures. In Proceedings of the IEEE 2015 International Conference on Computational Intelligence and Communication Networks (CICN), Jabalpur, India, 12–14 December 2015; pp. 951–955. [Google Scholar]
Dogan, O.; Kem, F.C.; Oztaysi, B. Fuzzy association rule mining approach to identify e-commerce product association considering sales amount. Complex Intell. Syst. 2022, 8, 1551–1560. [Google Scholar] [CrossRef]
Dogan, O.; Gurcan, O.F.; Oztaysi, B.; Gokdere, U. Analysis of frequent visitor patterns in a shopping mall. In Industrial Engineering in the Big Data Era; Springer: Berlin/Heidelberg, Germany, 2019; pp. 217–227. [Google Scholar]
Delgado, M.; Marín, N.; Sánchez, D.; Vila, M.A. Fuzzy association rules: General model and applications. IEEE Trans. Fuzzy Syst. 2003, 11, 214–225. [Google Scholar] [CrossRef] [Green Version]
Kumar, V.; Tan, P.; Steinbach, M. Association Analysis: Basic Concepts and Algorithms. 2006. Available online: https://www-users.cse.umn.edu/~kumar001/dmbook/ch6.pdf (accessed on 15 February 2023).

Figure 1. Flowchart of the proposed model.

Figure 2. Changes in average profit by support threshold.

Table 2. Fuzzy profit-support (FPS) values of frequent 1-item set.

Product	Low	Medium	High	Product	Low	Medium	High
1127L	4182.87	3179.77	371.25	1090L	1613.67	608.09	147.23
1018L	4653.20	3620.04	1727.16	1267M	84.04	545.33	40.72
1065M	258.08	472.57	35.56	1187M	68.45	239.68	4.53
1109L	1737.11	767.38	14.80	1180M	346.00	450.12	35.20
1139M	967.94	1430.46	20.59	1182M	147.64	688.15	41.59
1207M	201.38	593.11	89.06	1089L	158.41	118.37	9.37
1164M	418.07	905.05	59.73	1253L	770.58	365.98	20.45
1116L	318.32	72.63	4.81	1228L	579.74	553.89	33.02
1057L	821.68	466.02	2.61	1154L	2775.97	861.83	16.03
1104M	1280.57	1483.02	160.63	1098L	660.90	336.06	2.69
1048L	417.64	77.04	8.34	1074M	299.30	426.56	47.30
1099L	1581.19	1338.73	27.85	1049M	236.41	892.35	17.36
1006L	908.90	395.69	156.24	1210M	187.77	1447.35	79.14
1150L	3581.62	1184.42	153.72	1064L	939.85	4.26	45.54
1199M	2321.34	2805.22	18.19	1014M	304.14	336.15	27.27
1152L	2935.45	2546.30	77.00	1122L	739.35	686.70	46.98
1286L	566.53	326.21	49.75	1003M	464.20	665.73	12.26
1131L	389.90	166.10	3.40	1083M	66.15	220.99	141.10
1163L	251.37	92.05	3.72	1165L	879.75	141.67	11.60
1171L	724.13	403.30	1.50	1053L	2221.59	2033.65	92.46
1252L	2225.26	114.44	15.05	1075L	1003.71	389.52	33.31
1159H	187.82	485.58	588.91	1031M	445.94	2055.33	104.72
1070L	822.55	20.37	11.06	1001M	191.91	633.75	36.06
1196L	3413.361	2414.201	80.016

Table 3. Fuzzy support values of frequent 2-item set.

Products	FPS	Products	FPS	Products	FPS
{1001M, 1150L}	467.69	{1099L, 1150M}	236.12	{1180M, 1099L}	194.07
{1003M, 1098L}	313.39	{1099L, 1159H}	238.71	{1180M, 1131L}	344.56
{1018L, 1070L}	313.68	{1104M, 1003M}	555.15	{1180M, 1182M}	314.22
{1065M, 1099L}	625.32	{1116L, 1127L}	406.36	{1196L, 1089L}	165.30
{1065M, 1104M}	298.52	{1122L, 1199M}	169.35	{1199M, 1049M}	176.34
{1065M, 1109L}	281.68	{1131L, 1127L}	319.21	{1210M, 1165L}	228.82
{1065M, 1164M}	407.92	{1165L, 1210M}	387.61

Table 4. Fuzzy association rules with 45% of fuzzy confidence value.

Fuzzy Rules	F-Conf	Fuzzy Rules	F-Conf
If {1065M, 1164M} then 1127L	0.64	If {1116L, 1127L} then 1003M	0.52
If {1131L, 1127L} then 1199M	0.63	If {1001M, 1150L} then 1018L	0.52
If {1099L, 1159H} then 1210M	0.62	If {1122L, 1199M} then 1001M	0.52
If {1116L, 1127L} then 1018L	0.60	If {1001M, 1150L} then 1003M	0.51
If {1003M, 1098L} then 1122L	0.60	If {1199M, 1049M} then 1159H	0.50
If {1065M, 1164M} then 1104M	0.60	If {1196L, 1089L} then 1065M	0.49
If {1104M, 1003M} then 1098L	0.59	If {1196L, 1089L} then 1159H	0.49
If {1003M, 1098L} then 1018L	0.59	If {1065M, 1164M} then 1098L	0.46
If {1196L, 1089L} then 1089L	0.56	If {1065M, 1099L} then 1098L	0.45
If {1180M, 1182M} then 1164M	0.56	If {1018L, 1070L} then 1049M	0.45
If {1001M, 1150L} then 1182M	0.55	If {1116L, 1127L} then 1099L	0.45
If {1122L, 1199M} then 1070L	0.53

Table 5. A part of sales transactions.

Transaction	Product	Quantity	Unit Profit	Total Profit
275	1003	12	7	84
275	1093	8	12	96
275	1095	12	6	72
401	1003	10	7	70
401	1080	3	4	12
401	1093	11	12	132
401	1095	5	6	30
407	1002	7	20	140
407	1003	7	7	49
407	1080	1	4	4
407	1093	8	12	96
407	1095	1	6	6
521	1002	10	20	200
521	1003	6	7	42
521	1092	8	27	216
558	1080	6	4	24
558	1092	10	27	270
Total	125	188	1543

Table 6. P-ARM for frequent 1-item set (

m i n s u p p = 3

).

Table 6. P-ARM for frequent 1-item set (

m i n s u p p = 3

).

Transaction	Products
Transaction	1002	1003	1080	1092	1093	1095
275	0	12	0	0	8	12
401	0	10	3	0	11	5
407	7	7	1	0	8	1
521	10	6	0	8	0	0
558	0	0	6	10	0	0
Count	2	4	3	2	3	3
Quantity	17	35	10	18	27	18
Total Profit	340	245	40	486	324	108
PPI	1.10	0.79	0.13	1.57	1.05	0.35
Profit Support	2.20	3.18	0.39	3.15	3.15	1.05

Table 7. P-FARM profitability for frequent 1-item set (

m i n s u p p = 3

).

Table 7. P-FARM profitability for frequent 1-item set (

m i n s u p p = 3

).

Transaction	Products
Transaction	1002	1003	1080	1092	1093	1095
275		{0.00, 0.00, 1.00}			{0.00, 0.00, 1.00}	{0.00, 0.00, 1.00}
401		{0.00, 0.00, 1.00}	{0.25, 0.75, 0.00}		{0.00, 0.00, 1.00}	{0.00, 0.75, 0.25}
407	{0.00, 0.25, 0.75}	{0.00, 0.25, 0.75}	{0.75, 0.25, 0.00}		{0.00, 0.00, 1.00}	{0.75, 0.25, 0.00}
521	{0.00, 0.00, 1.00}	{0.00, 0.50, 0.50}		{0.00, 0.00, 1.00}
558			{0.00, 0.50, 0.50}	{0.00, 0.00, 1.00}
Total Fuzzy Score	{0.00, 0.25, 1.75}	{0.00, 0.75, 3.25}	{1.00, 1.50, 0.50}	{0.50, 0.00, 2.00}	{0.00, 0.00, 3.00}	{0.75, 1.00, 1.25}
Final Fuzzy Class	1002H	1003H	1080M	1092H	1093H	1095H
Fuzzy Count	1.75	3.25	1.50	2.00	3.00	1.25
PPI	1.10	0.79	0.13	1.57	1.05	0.35
Fuzzy Profit Support	1.93	2.58	0.19	3.15	3.15	0.44

Table 8. Numerical comparison of ARM, P-ARM and P-FARM.

Product	Unit Profit	ARM	P-ARM	P-FARM
1002	20
1003	7	1	1
1080	4	1
1092	27		1	1
1093	12	1	1	1
1095	6	1
Total Profit		29	46	39
Average Profit		7.25	15.33	19.5

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Dogan, O. A Recommendation System in E-Commerce with Profit-Support Fuzzy Association Rule Mining (P-FARM). J. Theor. Appl. Electron. Commer. Res. 2023, 18, 831-847. https://doi.org/10.3390/jtaer18020043

AMA Style

Dogan O. A Recommendation System in E-Commerce with Profit-Support Fuzzy Association Rule Mining (P-FARM). Journal of Theoretical and Applied Electronic Commerce Research. 2023; 18(2):831-847. https://doi.org/10.3390/jtaer18020043

Chicago/Turabian Style

Dogan, Onur. 2023. "A Recommendation System in E-Commerce with Profit-Support Fuzzy Association Rule Mining (P-FARM)" Journal of Theoretical and Applied Electronic Commerce Research 18, no. 2: 831-847. https://doi.org/10.3390/jtaer18020043

APA Style

Dogan, O. (2023). A Recommendation System in E-Commerce with Profit-Support Fuzzy Association Rule Mining (P-FARM). Journal of Theoretical and Applied Electronic Commerce Research, 18(2), 831-847. https://doi.org/10.3390/jtaer18020043

Article Menu

A Recommendation System in E-Commerce with Profit-Support Fuzzy Association Rule Mining (P-FARM)

Abstract

1. Introduction

2. Literature Review

3. Preliminaries

3.1. Association Rules

3.2. Fuzzy Association Rules

4. Proposed Methodology

5. Case Study: E-Commerce Sales Based on Profit

5.1. ETL Process of E-Commerce Sales

5.2. Data Analysis

5.3. Rules

6. Experimental Comparison and Discussion

7. Conclusions and Future Directions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI