An optimal peer selection for peer-to-peer video content distribution using fuzzy linear programming approach

M. Anandaraj; Naganandhini S.

doi:10.7717/peerj-cs.2701

An optimal peer selection for peer-to-peer video content distribution using fuzzy linear programming approach

M. Anandaraj ¹, Naganandhini S.²

1Department of Information Technology, PSNA College of Engineering and Technology, Dindigul, Tamilnadu, India

2Department of Computer Science and Engineering, PSNA College of Engineering and Technology, Dindigul, Tamilnadu, India

DOI: 10.7717/peerj-cs.2701

Published: 2025-02-28
Accepted: 2025-01-22
Received: 2024-09-04

Academic Editor: Željko Stević

Subject Areas: Computer Networks and Communications, Optimization Theory and Computation, Theory and Formal Methods
Keywords: Content distribution network, Fuzzy set, Fuzzy linear programming, Peer selection problem, Peer to peer network

Copyright: © 2025 Anandaraj and S.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits using, remixing, and building upon the work non-commercially, as long as it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.

Cite this article: Anandaraj M, S. N. 2025. An optimal peer selection for peer-to-peer video content distribution using fuzzy linear programming approach. PeerJ Computer Science 11:e2701 https://doi.org/10.7717/peerj-cs.2701

The authors have chosen to make the review history of this article public.

Abstract

The problem of peer selection in peer-to-peer (P2P) video content distribution network is significant to solve since it affects the performance and efficiency of the network widely. In this article, a novel framework is introduced that uses fuzzy linear programming (FLP) to address the inherent uncertainties in peer selection. The primary motivation for the use of FLP lies in its capability to handle the imprecision and vagueness that are characteristic of dynamic P2P environments. Factors such as peer reliability, bandwidth, and proximity are often uncertain in this environment. By using fuzzy logic, the proposed framework models these criteria as fuzzy sets and then integrates uncertainty into the decision-making process. FLP is then applied to optimize peer selection, improving download speed, reducing download time, and enhancing peer reliability. The proposed method is evaluated and analyzed using extensive simulation with SciPy. The result reveals that proposed technique works better compared to some of the traditional methods in terms of download time, download speed and also reliability measure. It also exhibits approximately 20% of increase in download speed as well as a 15% decrease in download time compared to traditional approaches. It leads to faster content retrieval and enhanced the efficiency in content distribution. Also, in selection of reliable peers for content distribution, there is a notable 20% of increase in peer reliability with result of enhanced robustness. The proposed method provides efficient and robust solution to the problem of peer selection. It can be implemented in a broad range of P2P content distribution networks.

Introduction

Decentralized characteristics are represented in peer-to-peer (P2P) networks where nodes act as both consumers and suppliers of any resources including any format content (Gebraselase, Helvik & Jiang, 2022). Any files, resources and also services can be directly shared among the connected nodes in these types of networks. Lack of reliance on a central authority is one important characteristic which distinguishes P2P networks with others. Such decentralized system assures that any peer within them can share the resources and files independently, add computing power and assign bandwidth to others leading to effective utilization of resources available in the network. Additionally, P2P networks reveal self organizing behavior which allows for seamless entry and exit of any participants with no effect on the system’s performance as a whole at any time. This dynamic adaptability assures autonomy even when the size and structure of the network increases and changes (Budhkar & Tamarapalli, 2020). The addition of any new peers in P2P networks increases availability of resources and redistributes workload among network participants. Search and retrieval of resources in the case of structured P2P networks are eased by explicit topologies like ring or tree structures. In contrast, random connections among the peers or some heuristics are used to connect unstructured networks allowing more adaptability and flexibility in node to node connectivity (Hwang et al., 2022). The nature of decentralization of P2P network enables participating peers to independently share files, computing power, allocate bandwidth among them directly. For an example, file sharing protocols such as BitTorrent allow one to download files from multiple sources at the same time and improve distribution of large files (Budhkar & Tamarapalli, 2020).

In addition, P2P networks have resulted in significant changes in resource sharing, collaboration and communication through distributed computing projects like SETI@home, which utilize a lot of computers for scientific computations (Gebraselase, Helvik & Jiang, 2022). The decentralization of operations offered by P2P networks is very beneficial. It enables resources to be shared among different users. But, in relation to security, unstructured environments present the major challenges with respect to data integrity and network efficiency (Hwang et al., 2022). However, continuous technological advancements are further improving P2P networks while addressing weakness and expanding their applications into diverse domains (Ren et al., 2020). Figure 1 shows the system structure of this particular P2P network. Distributing content across the globe via servers is a key characteristic of content distribution networks (CDNs) (Nacakli & Tekalp, 2021). These are systems that cache and distribute content such as web pages, multimedia contents, software downloads among other digital assets. The main goal of CDNs is to enhance the efficiency of delivery of content, reducing delays and also optimizing user experience by minimizing the time it takes for users to obtain required services (Yao, Xiang & Liu, 2024). One remarkable feature of CDNs is their exploitation of caching methods over several different servers located in several locations. It makes the same piece of content to be duplicated and stored in several different locations by allowing end users to access them from a physically nearby location (Luo et al., 2020). By dispensing sources across servers, CDNs optimize server overall performance, prevent server congestion, and ensure consistent and reliable facts transport (Farahani et al., 2023). Peer-to-peer content delivery networks (P2P CDNs) combine the decentralized architecture of P2P networks with the content delivery capabilities of CDNs that enable these networks to share and distribute information between them without interdependence on centralized server infrastructure (Farahani et al., 2023; Han & Nam, 2019).

Figure 1: P2P network structure.

Download full-size image

DOI: 10.7717/peerj-cs.2701/fig-1

Research highlights and paper organization

The proposed system aims at improving and modernizing network performance and resource allocation within a P2P network. The enhancement of efficiency and effectiveness of peer selection methodologies within P2P networks is also aimed at this research by incorporating fuzzy linear programming. Adaptability and precision are introduced in the decision-making process by fuzzy logic, accommodating the essential uncertainties and inaccurate data that are prevalent in those networks. Latency reduction and load balancing among the participating peers are targeted by this approach, but it also exhibits a paradigm shift in the direction of optimized utilization of available resource, scalability, and dynamic adaptability within P2P architectures. The introduction of FLP in the process of peer selection is aimed not only at minimizing latency and also improving load distribution, as well as giving higher priority on impartial content sharing among the participating peers. It guarantees an improved user experience through increased reliability, throughput, and accessibility. A redefinition of the P2P CDN domain is assured by this research, since it introduces a more flexible and responsive peer selection framework, eventually setting the foundation for a more efficient and robust network infrastructure. The major objectives of this article are given as follow.

Design a comprehensive model combing fuzzy logic with linear programming techniques to optimize the selection of peers in P2P CDN.
Examine the applications of the fuzzy linear programming technique in handling dynamic network conditions and uncertainties for ensuring robust peer selection mechanisms.
Analyze the impact of fuzzy linear programming on resource allocation, intending to attain impartial content distribution among the participating peers at the same time maximizing throughput and reliability of the network.

The remaining part of the article is prepared as follows. “Related Works” explains the literature review of the proposed work and existing mechanisms as well as their limitations to resolve the problem of peer selection. “System Model” describes the basic idea behind implementation of fuzzy logic programming and its applications. “Proposed System” elaborates the proposed method, including the fuzzy logic programming technique and the peer selection framework. “Performance Comparison” and “Result and Discussion” states the results and experimental analysis and discussion of the proposed system respectively. “Conclusion” finally gives the conclusion of the article and outlines future directions of research.

Related works

Peer selection problem in P2P networks

The peer selection problem is a critical challenge in P2P networks, especially within content distribution networks (CDNs), where effective peer selection significantly impacts network performance, reliability, and user experience (Nie et al., 2023). Traditionally, peer selection algorithms prioritize criteria such as proximity, bandwidth, latency, and historical performance, aiming to optimize the efficiency of content delivery by matching peers based on their capabilities and network states (Safara, Souri & Deiman, 2020). In this selection problem, finding out the most opt peers to download and share content within a P2P content distribution network is a big challenge. Because selecting the wrong peers will lead to a down in network performance, download speed. Also, it increases network congestion. To solve the problem of peer selection, P2P CDN uses various techniques. Ali et al. (2020) explored AI techniques, specifically machine learning, which capitalizes on peer’s knowledge. Their objective is to adapt the selection process to meet peers’ requirements. The free-riding problem in file sharing prompts the use of incentive mechanisms, such as fairness between uploads and downloads in schemes like BitTorrent (Luo et al., 2020; D’Alessandro Costa & Gonçalves Rubinstein, 2019). Our system is not designed for file-sharing environments. Instead, our focus lies in selecting the most suitable peer for specific service requirements, prioritizing performance over fairness. Distinct from file-sharing applications that typically involve relatively short burst chunk downloads/uploads, our approach aims to select the best peer for long-term sessions. In the domain of P2P multimedia stream services, similar studies address peer selection issues, aiming to optimize peer selection concerning service performance (Meng, 2018; Geng & Fujita, 2024). Additionally, the prevalence of P2P traffic poses challenges for internet service providers (ISPs). Several proposals (Hwang et al., 2022; Nacakli & Tekalp, 2021; Geng & Fujita, 2024) suggest collaborations between ISPs and P2P applications to help ISPs make informed choices about the best peers within their domains, aiming to reduce costs. In contrast, the scheme proposed in BPSS (best peer selection scheme) (Xue et al., 2023) selects optimal peers for service interactions within the geographical domain of the service requester, eliminating the necessity for explicit collaboration between service providers. Izhak-Ratzin, Park & Schaar (2012) presented a strategy for achieving an optimal peer selection technique using Q-learning. In their methodology, a peer’s action involves selecting target peers from a pool of n adjacent peers. In contrast, the original BitTorrent protocol divides the upload capacity of a peer p into four upload slots. Three slots are allocated for uploads to the three fastest peers that received the most uploads from p in the previous phase, while one slot is designated for uploads to a randomly chosen peer from the entire set of peers. Even though this approach could attain a nearly optimal scheme for small sized network, it could not be used to larger sized network because the number of states increases exponentially to the number of neighboring peers. Another one common approach is to use a tracker server. It keeps track of which peers have the content and can provide information about their availability and speed. The client then use this information to choose the best set of peers to download the content from Ghasemkhani et al. (2018). One more common technique is to employ a distributed hash table (DHT), which is a distributed in nature that permits efficient lookup of information using key. In a P2P CDN, a DHT is used to store the information about which peers in the network have which portion of the content (Kumar & Pandey, 2022). Clients use the DHT to pick the best possible set of peers to approach for downloading the content. In addition to these systems, P2P CDNs make use of a variety of algorithms to optimize peer selection like the random peer selection algorithm. The Chord algorithm uses a consistent hashing function to route requests to the appropriate peer (Masood et al., 2018). Even though these techniques are functioning well and good, the peer selection problem of those techniques continue to be a challenging problem to solve in P2P CDNs. In particular, it underperforms in large scale networks with a high degree of churn, where peers often join and leave the network at any point of time. In order to solve these challenges, researchers continue to examine new algorithms and techniques to optimize the selection of peer in P2P CDNs (Surati, Jinwala & Garg, 2017; Anandaraj et al., 2021). There are some mechanisms is used to solve the above stated problem in P2P CDNs. The following are some of the conventional solutions used to solve the problem of peer selection.

Peer selection at random: It is one of the simple technique used to select the peers at random for content download. It does not consider any criteria like peer download speed, availability or any other constraints but can still be effective in some circumstances.

Centralized peer selection: There is a centralized server or tracker used to provide the assistance to the peers to pick other set of peers for collaboration to download the content in this approach. There is a server to store and maintain a list of available peers and selects the best possible peers based on criteria like download speed, availability and other desirable criteria. It may suffer from a single point of failure and also there is a possibility to increase in network latency.

Decentralized peer selection: Peers work together to choose the best peers for content distribution in this approach. Peers can exchange information among themselves about their capacities like download speed, upload capacity, and availability to find out the best peers to download content from. This approach is complex to implement and also may need some additional communication overhead.

The selection of the most suitable solution for the peer selection problem in P2P CDNs relies on the specific characteristics of the network, such as the number of peers, network topology, and also content popularity (Naganandhini & Shanthi, 2023; Ren et al., 2020). A combination of these approaches may also be employed to enhance network performance and reduce network congestion, but difficult to integrate all those techniques.

Fuzzy liner programming

Fuzzy linear programming (FLP) is one type of programming technique which makes use of fuzzy logic to form algorithms. It can use to make informed decisions with help of even using imprecise and uncertain data (Loganathan & Ganesan, 2021; Edalatpanah, 2023). It is also one kind of a mathematical technique that can be used to resolve this problem by integrating uncertainty and also imprecision into the process of making decision (Wang et al., 2023; Yang & Zheng, 2021). It permits to obtain decisions with best currently available data, even if the data is not complete. FLP is utilized to resolve complex problems which cannot be solved with help of conventional programming schemes. It can also be utilized to find optimal processes and systems (Yang, Zhou & Cao, 2015). Due to FLP’s capability of managing fuzzy constraints, it is a perfect solution for the real-world problems where data and constraints are dynamic and high degree of control is not achievable (Goguen & Zadeh, 1973). FLP can therefore adapt to the changes in resource availability, demand and other conditions in the network unlike the conventional LP models. It can provide feasible as well as efficient solution compared to conventional methods by combining fuzzy logic and fuzzy sets into the optimization problem (Nguyen et al., 2019). Also, FLP’s use of linguistic variables and partial satisfaction enable it to incorporate human-based evaluations that are typical in network management environments. FLP has been found to be very useful in applications that involve the use of flexible, dependable solutions in areas such as network design, resource management, and scheduling (Liu et al., 2016). In network design, FLP supports the optimization of several and frequently competing objectives such as bandwidth, delay and cost. FLP also allows for an adaptable design strategy because it considers each criterion as having varying degrees of importance or priority. It will be useful in responding to the variability of network loads. FLP is used to resolve using conventional algorithms for linear programming such as the simplex technique. Also, fuzzy logic operations are utilized to handle the fuzzy decision variables and constraints (Zhang et al., 2024). The result of FLP is a set of best possible values for the fuzzy decision variables. It can be utilized to find out the best set of possible peers to choose for content distribution.

System model

P2P network comprising N identical peers linked by an overlay network are considered, where each peer is connected to n other peers through random links. A file slated for download is fragmented into multiple chunks in advance and these pieces are dispersed across the network at random. Initially, every peer has an equal number of pieces. The primary aim of the peers is to gather all pieces by engaging in mutual exchange of piece among themselves. Specifically, any peer is termed as a seeder upon acquiring all pieces, while a peer still in need of a piece from others is labeled as a leecher. It is elaborated using a mathematical model in this section. Assume N denotes the total number of peers in the P2P network, and n represents the fixed number of connections and each peers has within the network. The file to be downloaded is divided into M pieces in advance, and each peer starts with an equal share of P pieces initially. Hence, P can be expressed as:

(1) $P = N / M .$

In this context, a seeder is a peer that has successfully accumulated all M pieces, while a leecher is a peer that still requires at least one piece to complete the entire file. The aim for each leecher is to acquire all the missing pieces. It can be done through peer-to-peer interactions. The overall objective across the network is for all peers to transition from leechers to seeders by effectively exchanging pieces among themselves until every peer has obtained the complete file. Each peer need to maintain an adjacency matrix A, where A_jk = 1 if there is a connection between peers j and k, and A_jk = 0 otherwise. The transfer rate between any two connected peers j and k is denoted by r_jk, representing the capacity of the link between them. Each leecher requests missing pieces from its connected seeders. The request is handled at random with one of the available seeders connected to the leecher. The successful probability of a piece of content transfer between a seeder and a leecher is represented by p_jk, where p_jk = 0 if there’s a unsuccessful transfer and p_jk = 1 otherwise.

The probability p_jk rely on various factors such as congestion of network, latency, and connection stability, among others at any given point of time. The expected time for a leecher to attain any missing piece from a seeder is represented using probability theory. Suppose T_jk represents the time taken for leecher j to download a piece from seeder k. Then the expected time E[T_jk] is formulated as given below:

(2) $E [T_{j k}] = 1 / r_{k j} \times p_{k j} .$

The expected time to be a seeder from leecher, for instance collection of all missing pieces, is influenced by the number of connected seeders, transfer rates, and the success probabilities of piece transfers at any time slot. A complete model considering these factors is able to predict the time taken for the entire network to evolve as seeders from leechers. Even though, this model’s complexity rises with the number of peers, connections, and other dynamics of network. The modeling and derivation is further extended by integrating dynamic nature of network, probabilistic rate of transfer, and interaction patterns among the peers to make a more comprehensive model capturing the evolution of seeders from leechers over the period of time in a P2P network for content dissemination. Consider a dynamic nature of P2P network where dynamics of network persuade the transfer rates r_jk and the success probabilities p_jk between peers. These dynamics may consist of varying capacities of link, connectivity changes in peer, fluctuations in network congestions, and also time varying latency. The rate of transmission r_jk among peers j and k is affected by factors such as availability of bandwidth, network congestion and also conditions at the time of transfer. A dynamic model for r_jk is given as a function of time, represented as below:

(3) $r_{j k} (t) = f_{d y n a m i c s} (t) .$

Correspondingly, the probability of success p_jk for piece transfer is defined as probabilistic because of fluctuations in network conditions and behavioral change in peer. This probability may vary over time appropriate to factors like network stability, packet loss, and availability of peer:

(4) $p_{j k} (t) = g_{d y n a m i c s} (t) .$

Also, the interaction patterns among the peers influences the process of peer selection and as a result persuade the time for leechers to accomplish download missing pieces. This interaction pattern entails preferences among peers like preference on high-speed connections or favoring peers with the past successful transfers. A comprehensive model which captures the progression of leechers to seeders over the period of time in this dynamic network would involve time-dependent functions to characterize the changing rate of transfer and probabilities of success. The Monte Carlo simulation modeling technique is used to simulate the interactions among peers, the distribution of file, and the evolution of peers from leechers to seeders. By integrating dynamic nature of network, probabilistic transfer rates, and interaction patterns among peers, such a model envisage the time evolution of content distribution. It consider network variations and behavior change of peer over period of time. The complexity of this model would be high due to the dynamic nature of the network and they need to account for various network conditions and interactions among the peers. Suppose L_j(t) denotes a number of missing pieces in peer j and it needs to become a seeder at time t, and S_j(t) as the number of pieces peer j has already collected. The rate at which peer j attains missing pieces is given by the sum of successful transfer of pieces from other peer k to peer j at time t, is given as follow.

(5) $\frac{d L_{j} (t)}{d t} = \sum_{k \neq j} r_{j k} (t) X p_{j k} (t) X (L_{k} (t) - S_{k} (t)) .$

This equation signifies the rate of change in the number of missing pieces for peer j relies on the successful transfers from other peer k to j, assuming the missing pieces at k and the pieces j has already received. Figure 2 shows the system model.

Figure 2: System model.

Download full-size image

DOI: 10.7717/peerj-cs.2701/fig-2

Proposed system

Our method addresses the complexity and unpredictability of network conditions in by integrating fuzzy logic into the decision-making process. Peer characteristics in traditional P2P content sharing include reliability, latency, and bandwidth, and these changes over time. These are making the process of consistent and effective peer selection more challenging. By incorporating fuzzy logic, it is possible to model these uncertainties and make the decisions with regard to network conditions changeable. This methodology is more practical and realistic compared to the conventional deterministic approaches. The FLP technique involve to formulate a mathematical model which captures the relationship among the decision variables for peer selection and the objective function includes download speed and other set of parameters. The FLP technique involves formulating a mathematical model which captures the relationship among the decision variables for peer selection and the objective function includes download speed and other set of parameters. The model also includes constraints which limit the feasible and optimal values of the decision variables. In order to obtain a solution to the FLP model, fuzzy logic is used to characterize the vagueness and imprecision associated with the data. The model is then resolved using linear programming techniques such as the simplex technique to obtain the optimal solution. It minimizes the objective function with subject to the some of the chosen constraints.

FLP model

The proposed peer selection method is formed using mathematical model by modifying the standard model to combine fuzzy sets as well as fuzzy arithmetic. The model consists of the following important elements:

Peers or nodes: Let P = {p₁, p₂, …, p_n} used to indicate the set of available peers in the network at any point of time. Each peer p_i contains a local cache of video content C(p_i) and has some certain speed of downloading the video content v(p_i).

Content: Let C = {c₁, c₂, …, c_m} used to represent the set of video content in the network to be downloaded. Each piece of video content c_i is segmented into k chunks and located on several peers in the network. The availability of chunk j of video content c_i on peer p_i is used to denote a fuzzy variable A(p_i, c_i, j) which captures fuzzy values representing the degree of availability.

Network topology: Let G = (P, E) used to represent the graph containing the connections among the available peers in the network. Each edge e in E used to denote a direct link among any two peers.

Peer selection algorithm: Let S(p_i, c_i) are the fuzzy set of available candidate peers from which peer p_i can download chunk c_i of video content. The suitability of peer p_i for downloading chunk c_i is used to symbolize the fuzzy variable S(p_i, c_i). It takes fuzzy values denoting the degree of suitability.

Optimization objective: The objective of the proposed model is to minimize the time to attain total download for a given set of video content. Let T(p_i, c_i, j) used to denote the time it takes for peer p_i to download chunk j of video content c_i. The total time needed to download entire content can be formulated using fuzzy arithmetic as given below.

(6) $M i n i m i z e μ (\sum P_{i} \sum c_{i} \sum j A (p_{i}, c_{i}, j) * T (p_{i}, c_{i}, j))$

Subject to:

➢

A(p_i, c_i, j) ∈ F: It represents the availability of chunk j of video content c_i on peer p_i, where F is set of fuzzy sets.
➢

Σ p_i A(p_i, c_i, j) ≥ r(c_i, j): As a minimum r(c_i, j) peers need to have chunk j of video content c_i for the video content to be considered as available with help of fuzzy IF-THEN rules.
➢

T(p_i, c_i, j) = k x s(c_i)/v(p_i): it is used to represent the time needed for any peer p_i to download chunk j of video content c_i, where k symbolizes the size of the chunk, and s(c_i) is the size of video content c_i.
➢

S(p_i, c_i) ∈ F: The suitability of peer p_i for downloading chunk c_i of video content, where F denotes the set of fuzzy sets.
➢

Σ p_i Σ c_i A(p_i, c_i, j) x s(c_i) ≤ d(p_i): The total amount of video content downloaded by any peer p_i cannot go beyond its download limit d(p_i).
➢

Σ p_i Σ c_i A(p_i, c_i, j) ≥ b(c_i, j): At least b(c_i, j) chunks of video content c_i must be downloaded by set of peers to ensure no redundancy and some reliability, using fuzzy IF-THEN rules.

The fuzzy sets A(p_i, c_i, j) and S(p_i, c_i) is defined using linguistic variables such as “high,” “medium,” and “low.” The fuzzy measure μ is selected based on the desired aggregation method, such as max-min or max-product. By including fuzzy sets and fuzzy arithmetic, FLP offers more flexible and robust technique for peer selection and better reflecting the real-world variability of network conditions and peer behavior.

Convert FLP to LP

The FLP problem for the peer selection is transformed into an equivalent LP problem using alpha-cuts. Alpha-cuts are used then to transform the fuzzy constraints and fuzzy decision variables into crisp constraints as well as variables which can be utilized in a standard LP problem (Zhang et al., 2024). The following algorithm is used to translate the FLP problem to an LP problem.

The above steps are iterated until the solution is satisfactory one. The alpha-cuts must be adjusted based on the degree of uncertainty and also imprecision in the fuzzy decision variables and constraints (Wang et al., 2023). The resulting LP problem offers a flexible and also efficient technique to the problem of peer selection. Figure 3 represents the steps involved in FLP.

Figure 3: FLP steps.

Download full-size image

DOI: 10.7717/peerj-cs.2701/fig-3

Alpha cut

In FLP for the problem of peer selection, alpha-cuts are used to translate the fuzzy constraints into crisp constraints. It can be resolved using standard LP algorithms. An alpha-cut is used to depict the level of certainty that describes a crisp set from a fuzzy set. It indicates the minimum degree of membership that an element must include in the fuzzy set need to be incorporated in the crisp set (Nasseri, Verdegay & Mahmoudi, 2021). For example, suppose fuzzy decision variable y that represents the degree to which a peer is selected and its membership function is defined as:

(7) $y = {0 / 1, 0.1 / 0.9, 0.2 / 0.8, . . ., 0.9 / 0.1, 1 / 0} .$

This membership function indicates that the membership degree of peer i being chosen can be in the range between 0 and 1, with values that are more likely to be chosen having a higher membership degree. A level of confidence known as the alpha level is chosen, and only those elements of the fuzzy set whose membership degree is at least equal to the alpha level are taken into consideration in order to convert this fuzzy constraint into a crisp constraint using alpha-cuts. For example, suppose an alpha level of 0.5 is chosen. Then the corresponding alpha-cut for the fuzzy decision variable y would be:

(8) $y_a l p h a = {0 / 1, 0 / 1, 0 / 1, . . ., 1 / 0, 1 / 0} .$

Algorithm:

Convert FLP to LP.

Consider fuzzy decision variables represent the suitability of peers S(p_i,c_i) and fuzzy constraints based on availability and other criteria.

- Input:

- - Fuzzy decision variables and also their membership functions

- - Fuzzy constraints based on criteria for selection

- - Alpha value for alpha-cuts (0 <= α <= 1)

- Output:

- - Corresponding Linear Programming Problem (LP)

Step 1: Describe Fuzzy Decision Variables and Membership Functions

Describe the fuzzy decision variables S(p_i,c_i) denoting the suitability of each peer p_i to download video content c_i. Consider the membership functions μS_i(s) for each fuzzy decision variable.

Step 2: Define Fuzzy Constraints

Fuzzy constraints based on the availability and other criteria are described. Assume fuzzy constraint A(p_i,c_i,j) describing the availability of chunk j of video content c_i on peer p_i with membership function μA_ij(a).

Step 3: Determine Alpha-Cuts

For each fuzzy decision variable as well as constraint, find out alpha-cuts using the specified alpha value in the range between 0 and 1 such as 0 ≤ α ≤ 1.

For fuzzy decision variable S_α(p_i,c_i) = {s | μS_i(s) ≥ α}

For fuzzy constraint A_α(p_i,c_i,j) = {a | μA_ij(a) ≥ α}

Step 4: Translate Fuzzy Decision Variables into Crisp Decision Variables

Find out crisp decision variables based on alpha-cuts using inverse membership functions given below

s = μS_i⁻¹(α)

Step 5: Translate Fuzzy Constraints to Crisp Constraints

Find the crisp constraints using alpha-level set and also membership functions. For fuzzy constraint A(p_i,c_i,j) with alpha-cut Aα(p_i,c_i,j):

a_min = min(Aα( p_i,c_i,j)) and a_max = max(Aα(p_i,c_i,j))

Step 6: Formulate Equivalent LP Problem

Devise the equivalent LP problem using crisp decision variables as well as constraints:

Minimize c^T⋅y

Subject to A_y ≤ b

s_minimum ≤ s ≤ s_maximum and a_minimum ≤ A(p_i,c_i,j) ≤ a_maximum

DOI: 10.7717/peerj-cs.2701/table-102

This alpha-cut states that the peer i is either completely chosen (membership degree is equal to 1) or completely not chosen (membership degree is equal 0) for the alpha level of 0.5. In other words, it offers a crisp set of peers that can be utilized in the crisp LP problem. By changing the alpha level from 0 to 1, family of alpha-cuts which offer different levels of certainty in the fuzzy decision variable y is attained. These alpha-cuts is then used to solve the FLP problem and reach a fuzzy solution. It denotes the degree to which each peer is chosen at different levels of certainty. Let us assume the availability constraint:

$\begin{aligned} \sum (m e m b e r s h i p d e g r e e o f p e e r i f o r a v a i l a b i l i t y) * (u p l o a d r a t e o f p e e r i) \\ > = (r e q u i r e d a v a i l a b i l i t y f o r t h e c o n t e n t) \end{aligned}$

Consider the membership function for availability is defined by a triangular fuzzy set with a peak at 500 KB/s, and a base extend from 300 to 700 KB/s. To convert this constraint into an LP constraint, alpha-cuts is utilized. For example, if an alpha value of 0.7 is chosen, then the corresponding interval for the availability constraint is 460, 540 KB/s. It means that for all peers whose upload rate drops within this interval, their membership degree for availability is at least 0.7. Similarly, the alpha-cuts are determined for the other fuzzy decision variables and constraints. These alpha-cuts are used to construct the equivalent LP problem which can be solved using standard LP techniques. The choice of alpha value relies on the degree of uncertainty and imprecision in the fuzzy decision variables and also constraints, and can be adjusted accordingly to attain a satisfactory solution.

Solve LP after convert from FLP

Obtain the LP problem for the FLP approach to the peer selection problem in P2P CDN involves finding the optimal combination of peers which are all maximize the efficiency of content distribution at the same time satisfying the selection criteria. The LP problem is devised by converting the fuzzy constraints and also decision variables into crisp constraints and variables using alpha-cuts, as described in previous section. The common steps for solving an LP problem are given below.

Devise the LP problem: Define the decision variables, objective function, and constraints for the problem in terms of linear expressions.
Translate the LP problem into standard form: Revise the LP problem so that all constraints are equations, all decision variables are non-negative, and the objective functions is to be maximized or minimized.
Use simplex algorithm: Employ the simplex algorithm to iteratively resolve the LP problem by moving along the edges of the viable region until the optimal solution is reached.
Infer the solution: Once the optimal solution is found, deduce the solution in terms of the decision variables and constraints, and determine if the solution suits the original problem statement.

The exact LP problem formulation and solution approach rely on the specific selection criteria and fuzzy decision variables and also constraints used in the FLP (Rivaz, Nasseri & Ziaseraji, 2020; Zhang, 2024). Once the LP problem is devised, it can be converted to standard form and resolved using the simplex algorithm to attain the optimal combination of peers which maximizes the efficiency of content distribution at the same time satisfying all the selection criteria.

Fuzzy decision variables

Optimization of peer selection in P2P CDN involves considering several factors for efficient distribution of content among the peers. Fuzzy variables are used to represent imprecise and uncertain aspects of these factors (Anandaraj et al., 2024). These fuzzy variables symbolize the variations and uncertainties related with peer characteristics. Fuzzy logic and fuzzy sets are used in those variables for more accurate modelling and optimizing the processes of peer selection. It is used to consider the vague and dynamic nature of network conditions and peer attributes. These variables are incorporated into an optimization model which permits for better decision making in selection of optimal peers for content distribution in P2P networks. The process of fuzzification is used to illustrate the crisp input variables, bandwidth (Bw), availability of content (Ca), and distance measure or proximity value (Dm), by using membership functions (MFs) and also linguistic terms. Then these variables are standardized within the ranges of [0 to 200], [0 to 2,000], and [0 to 70] correspondingly. The MFs selection is subjective by the linguistic terms with related to the output values. For instance, trapezoidal MFs are suitable and ideal when a range of values result in smallest number of changes, while triangular MFs are more suitable when there’s a gradual change. Non-overlapping MFs are selected for sensitivity to changes in input variables, contrasting with overlapping MFs; in particular, bandwidth is described by five MFs denoted as very low bandwidth (VLB), low bandwidth (LB), medium bandwidth (MB), high bandwidth (HB), and very high bandwidth (VHB). VLB and VHB are modelled using left-right open-sided trapezoidal MFs, while LB, MB, and HB are represented by triangular MFs as depicted in Fig. 4. content availability (CA) is characterized by five MFs with linguistic terms very low (VLA), low (LA), moderate (MA), high (HA), and very high (VHA). In this context, SD and LD are represented by left and right open-sided MFs respectively, while AD is modelled using trapezoidal MFs, demonstrated in Fig. 5. The distance measure (DM) employs five MFs denoted as very close (VL), close (L), moderate (M), far (F), and very far (VF). These MFs are illustrated in Fig. 6, using a combination of open-sided and trapezoidal shapes to represent different degrees of proximity or distance.

Figure 4: Fuzzified input variables-bandwidth availability.

Download full-size image

DOI: 10.7717/peerj-cs.2701/fig-4

Figure 5: Fuzzified input variables-content availability.

Download full-size image

DOI: 10.7717/peerj-cs.2701/fig-5

Figure 6: Fuzzified input variables-distance measure.

Download full-size image

DOI: 10.7717/peerj-cs.2701/fig-6

Calculate the optimal fuzzy decision variables

Calculating the optimal fuzzy decision variables for the FLP approach to the problem of peer selection in P2P CDN involves solving the LP problem and then applying alpha-cuts to convert the crisp decision variables back into fuzzy variables. An example of optimal fuzzy decisions variables calculation using the FLP approach for the problem of peer selection in P2P CDN is given below:

1.

Describe the membership functions for the fuzzy decision variables x_i, which represent the degree to which each peer i is chosen:

(9) $x_{i} = {0 / 1, 0.1 / 0.9, 0.2 / 0.8, . . ., 0.9 / 0.1, 1 / 0}$

2.

Alpha-cuts are used to translate the fuzzy constraints into crisp constraints:

For each alpha-cut level α:

Availability constraint is represented as ∑(degree of membership for peer i for availability at α) * (upload rate of peer i) * x_i >= (availability requirement for the content at α)

Proximity constraint is represented as ∑(degree of membership for peer i for proximity at α) * (upload rate of peer i) * x_i >= (proximity requirement for the content at α)

Reliability constraint: ∑(membership degree of peer i for reliability at α) * (upload rate of peer i) * x_i>= (required reliability for the content at α)

Bandwidth constraint: ∑(membership degree of peer i for bandwidth at α) * (upload rate of peer i) * x_i<= (total available bandwidth for the content at α)

3.

Solve the crisp LP problem to obtain the optimal values of the binary decision variables xi.

4.

Calculate the alpha cuts for the optimal fuzzy decision variables by evaluating the membership functions at the optimal values of xi. For example, if the optimal values of xi are [0, 0, 1, 1, 0, 1, 0], the alpha-cuts can be calculated as follows:

α = 0.1:xi = {0, 0, 1, 1, 0, 1, 0}
α = 0.2:xi = {0, 0, 1, 1, 0, 1, 0}
…
α = 0.9:xi = {0, 0, 1, 1, 0, 1, 0}
α = 1.0:xi = {0, 0, 1, 1, 0, 1, 0}

The resulting alpha-cuts represent the degree to which each peer i is selected at each level of certainty, ranging from 0.1 to 1.0. The formulation of LP problem and solution will rely on the exact selection criteria and fuzzy decision variables as well as constraints used in the FLP technique. The LP problem formulation for the problem of peer selection and alpha cuts for the fuzzy decision variables x_i is described as follow.

x_0 = [0, 0, 0.2, 0.8, 1, 1, 1]
x_1 = [0, 0.2, 0.8, 1, 1, 1, 1]
x_2 = [0, 0.1, 0.7, 0.9, 0.9, 0.9, 0.9]
x_3 = [0, 0, 0, 0.2, 0.6, 0.8, 1]
x_4 = [0, 0, 0, 0, 0.1, 0.7, 0.9]
x_5 = [0, 0, 0, 0, 0, 0.2, 0.8]
x_6 = [0, 0, 0, 0, 0, 0, 0.3]

Those alpha cuts specify the membership degree of each peer i with the relevant fuzzy decision variable x_i. The alpha cut for x_1 represents that peer1 has zero membership for being chosen when x_i contains low value and full membership when x_i contains high value. Using those alpha cuts, the LP problem can be resolved to reach the optimal crisp decision variables for the problem. Let us consider the optimal solution is specified below:

x_0* = 1
x_1* = 1
x_2* = 0.9
x_3* = 0.6
x_4* = 0.1
x_5* = 0.2
x_6* = 0

In order to obtain the feasible and optimal fuzzy decision variables, it is necessary to associate the alpha cuts to each crisp variable to attain a range of values. For instance, the alpha cut for x_1* reveals that peer1 has full membership for being chosen whenever x_i is high, and zero membership whenever x_i is low.

Proposed algorithm

Algorithm 1:

Peer selection algorithm.

Input:

Set of all peers in the P2P network (Peers)

Peer’s performance and reliability metrics (Performance, Reliability)

Fuzzy linear programming model parameters (Constraints, Weights)

Output:

Selected set of peers to distribute the video content (SelectedPeers)

Procedure PeerSelection(Peers, Performance, Reliability, Weights, Constraints)

Initialize an empty set SelectedPeers

Fuzzy linear programming

for each peer in Peers do:

Calculate the performance score (P_score) for the peer using the given metrics

Calculate the reliability score (R_score) for the peer using the given metrics

Apply fuzzy logic to calculate fuzzy performance score (Fuzzy_P_score)

Apply fuzzy logic to calculate fuzzy reliability score (Fuzzy_R_score)

Fuzzy linear programming optimization

f or each peer in Peers do:

Define the optimization problem using fuzzy linear programming model:

Maximize Σ_i (Weights[i] * Fuzzy_P_score[i]) + Σ_i (Weights[i] * Fuzzy_R_score[i])

Set the objective function based on the fuzzy performance and reliability scores

Add constraints based on the given constraints and weights:

Σ_i (Weights[i] * P_score[i]) ≥ Constraints[1]

Σ_i (Weights[i] * R_score[i]) ≥ Constraints[2]

Solve the fuzzy linear programming problem to reach the optimal solution

Selection of peers based on the optimal solution

for each peer in Peers do:

if the peer is selected based on the optimal solution then

Add the peer to the SelectedPeers set

return SelectedPeers

End Procedure

DOI: 10.7717/peerj-cs.2701/table-101

Pseudocode framework

Step 1: Initialize the system and define peer set

Input:

Set of peers, P = {P1, P2, …, Pn}

Parameters for each peer Pi:

- Bandwidth (Bw_i): Fuzzy value with membership function μ(Bw_i)

- Latency (Lat_i): Fuzzy value with membership function μ(Lat_i)

- Reliability (Rel_i): Fuzzy value with membership function μ(Rel_i)

Define acceptable thresholds for each parameter:

- Minimum acceptable bandwidth (Bw_min)

- Maximum acceptable latency (Lat_max)

- Minimum acceptable reliability (Rel_min)

Step 2: Define the fuzzy membership functions

For each parameter:

- Define membership function for bandwidth μ(Bw) with fuzzy thresholds.

- Define membership function for latency μ(Lat) with fuzzy thresholds.

- Define membership function for reliability μ(Rel) with fuzzy thresholds.

Step 3: Set up the Fuzzy Linear Programming (FLP) Model

Objective:

Maximize the overall score for peer selection, Score(Pi), defined as a weighted sum:

Score(Pi) = w1 * μ(Bw_i) + w2 * μ(Lat_i) + w3 * μ(Rel_i)

where w1, w2, w3, w4 are weights assigned to each parameter based on importance.

Constraints:

- Assure each selected peer meets the minimum requirements with fuzzy constraints:

μ(Bw_i) ≥ μ(Bw_min)

μ(Lat_i) ≤ μ(Lat_max)

μ(Rel_i) ≥ μ(Rel_min)

Step 4: Define Linear Constraints with Fuzzy Parameters

For each constraint, translate the fuzzy requirements to linear inequalities using α-cuts:

- Define α as the level of satisfaction such as α = 0.8.

- Convert fuzzy parameters to α-cut crisp intervals:

Bw_min_α = interval for bandwidth at level α

Lat_max_α = interval for latency at level α

Rel_min_α = interval for reliability at level α

- Update constraints as follows:

Bw_i ≥ Bw_min_α

Lat_i ≤ Lat_max_α

Rel_i ≥ Rel_min_α

Step 5: Solve the Fuzzy Linear Programming Model

Initialize:

- Selected peers list S = {}

- For each peer Pi in P:

- Find Score(Pi) based on fuzzy membership values and weights

- Calculate_alpha_cut(min, opt, max, alpha)

- If Pi meets all α-cut constraints:

- Add Pi to S if Score(Pi) is higher than threshold

Output:

- Selected peers S with maximum Score(Pi)

Step 6: Return Optimal Peer Set

Return the peer set S which maximizes the video content distribution quality.

Function calculate_alpha_cut(V_min, V_opt, V_max, alpha):

Calculate the lower bound of the α-cut interval

V_low = V_min + alpha * (V_opt - V_min)

Calculate the upper bound of the α-cut interval

V_high = V_max - alpha * (V_max - V_opt)

Return the interval [V_low, V_high]

Return (V_low, V_high)

End Function

Function Calculate_Score (Pi, Performance_Metrics, Reliability_Metrics, Weights)

Extract Raw Performance and Reliability Values for the Peer

Raw_Bandwidth = Performance_Metrics[Pi][“Bandwidth”]

Raw_Latency = Performance_Metrics[Pi][“Latency”]

Raw_Stability = Reliability_Metrics[Pi][“Stability”]

Raw_Availability = Reliability_Metrics[Pi][“Availability”]

Determine the Raw Performance Score (P_score) Based on Performance Metrics

Here, it is assumed that Performance is based on Bandwidth and Latency

Normalize these values if needed and combine them into a single score

Assuming higher Bandwidth and lower Latency are preferred:

P_score = (Weight_Bandwidth * Raw_Bandwidth) - (Weight_Latency * Raw_Latency)

Normalize P_score to a 0-1 scale if necessary, depending on range

P_score = normalize(P_score, min=0, max=100) # Example normalization function

Compute the Raw Reliability Score (R_score) Based on Reliability Metrics

Assuming Reliability is based on Stability and Availability

Higher Stability and Availability values are ideal.

R_score = (Weight_Stability * Raw_Stability) + (Weight_Availability * Raw_Availability)

Normalize R_score to a 0-1 scale if necessary

R_score = normalize(R_score, min=0, max=100)

Apply Fuzzy Membership Functions to Calculate Fuzzy Scores

The fuzzy membership functions (μ_Performance and μ_Reliability) determine the fuzzy satisfaction level

for each raw score. Membership functions is trapezoidal based on requirements.

Fuzzy_P_score = apply_fuzzy_membership(P_score, “Performance”)

Fuzzy_R_score = apply_fuzzy_membership(R_score, “Reliability”)

Determine the Weighted Score for Each Fuzzy Metric

weights for Performance and Reliability to compute the final weighted score.

Weighted_Fuzzy_P_score = Weights[“Performance”] * Fuzzy_P_score

Weighted_Fuzzy_R_score = Weights[“Reliability”] * Fuzzy_R_score

Combine Weighted Scores to Get the Final Score for Peer Pi

Score_Pi = Weighted_Fuzzy_P_score + Weighted_Fuzzy_R_score

Return Score_Pi

End Function

DOI: 10.7717/peerj-cs.2701/table-103

Performance comparison

Simulation using SciPy

SciPy is a popular open source library in Python programming language which is especially developed to offer robust and friendly numerical algorithms for a large number of scientific and mathematical computations. NumPy is the base from which SciPy is developed further to include higher level materials specialized to do scientific computations. These include optimization, integration, interpolation, eigenvalues, algebraic and differential equation solving and statistical analysis advanced modules. Another of SciPy’s main advantages is that it has a lot of reliable and specific data structures. For example, it supplies sparse matrices, which are useful for working large data sets containing many zeros and k-dimensional trees (kd-trees) to support the spatial search. These structures make SciPy particularly suited for applications in machine learning, data science, physics, and engineering. SciPy’s implementation wraps around low-level, highly optimized code written in languages like Fortran, C, and C++, delivering computational performance close to native code while retaining the flexibility of Python. This combination enables developer to do lots of numerical computation while at the same time to do it efficiently. As such, SciPy is used in academia, research, as well as in industries because fast and accurate scientific computing is critical.

Simulation settings

The performance evaluation of proposed system is done with help of implementing the system with SciPy. SciPy is an important module of Python environment that has specific application in scientific computing especially in the context of optimization process. It is suitable to simulate the problems related to selection of peers in P2P networks. In the distribution of video content, many peers are involved and they have different values for bandwidth, reliability, and latency, hence identifying the best set is a difficult task. The simulation is done to vary the nodes from 100 to 600 in the network. In terms of assessing the performance of the proposed work, parameters such as download time, download completion rate and overhead on the network have been employed. At the time of simulation, the majority of the links are defined to be an ADSL by fixing less upload bandwidth than download bandwidth. The simulation parameters are given in Table 1, and the utilization of uplink and downlink bandwidth is shown in Table 2. Download bandwidth and upload bandwidth is assumed to be in terms of Mbps. This is where a fuzzy linear programming approach comes in. Because of the vagueness associated with this peer attributes, fuzzy logic is used to map them to fuzzy sets that approximate the suitability of the peers to distribute content (Villacorta et al., 2017). The optimization problem is based on these scores, which indicate how well each peer in a network satisfies a particular requirement. This optimization challenge is then resolved using SciPy’s optimize module, most especially with the linprog function. While network limitations, such as the minimum bandwidth, the maximum tolerable latency, and the number of peers to be selected, are specified as either equality or inequality constraints, fuzzy scores are employed as the objective function coefficients in the linear programming model. In order to evaluate the performance of the proposed system, traditional peer selection algorithms mentioned in the previous section is used. It is performed through the following measures.

Table 1:

Simulation settings.

Parameter	Value
Connection type	ADSL
Bandwidth availability	2 mbps
File size	1 Gb
Block size	1 Mb
Neighbor peer count	10 to 15 nodes
Number of nodes	100 to 600 nodes

DOI: 10.7717/peerj-cs.2701/table-1

Table 2:

Bandwidth availability of each peer.

Bandwidth for download (Mbps)	Bandwidth for upload (Mbps)	Number of active nodes
128	128	50
256	128	50
384	256	50
512	256	100
640	256	100
1,280	384	100

DOI: 10.7717/peerj-cs.2701/table-2

Download time: It is a time required to download the necessary chunk of video content and it is a crucial measure to assess the performance of the peer selection algorithm. It’s measured in milliseconds (ms).
Download completion rate (DCR): It is another significant measure to assess the performance of the peer selection mechanism. FLP is used to help in selecting the best possible and available set of peers to download a desired chunk of content, reduces the probability of partial or failed download.
Network overhead: It is defined as the amount of content exchanged among the peers. It is another important measure for assessing the efficiency of the selection algorithm. FLP optimize the selection of peers by minimizing the amount of redundant data transfer, lead to lower network overhead.
Availability: The availability of a peer is defined as the percentage of time the peer is online and able to send the content. Peers with low availability are not preferred since they are likely to be able to provide video content when requested.
Upload capacity: The upload capacity of a peer is the maximum amount of content which can be transferred to other peers by any peer at any particular time slot. Peers with high upload capacity need to be chosen since they are able to provide content to several peers simultaneously, reducing overall download time for all peers.

In order to use FLP to solve the problem of peer selection, each of these metrics is assigned a membership function which differentiates the degree to which each peer satisfies the metric. These membership functions is then used to construct a fuzzy linear programming problem that seeks to maximize a weighted sum of the metrics, subject to constraints on the number of peers selected and the total bandwidth available. The solution to this problem provide a set of all peers which are optimal for downloading content from, based on the specified performance metrics and their respective weights. Table 3 outlines the mean upload and downloads bandwidth utilization percentages of the network across various numbers of nodes, displaying the values for the best run of the simulation. Packet redundancy is vital performance metric. It is used to assess the effectiveness of the proposed work. The number of times slots for any particular chunk is replicated across the network is defined as packet redundancy. The proposed work aims to reduce redundant transmission of same packets by choosing the best set of possible peers to download based peer and content availability, network congestion, and replication level of the packet. The performance evaluation of the proposed framework with respect to redundant packet transmission has been done and compared the packet replication levels with that of conventional methods. The simulations have been designed to vary the number of peers, the size of content, and the network conditions at any particular time slot. Apart from analyzing the levels of packet replication, the impact of packet redundancy on other performance measures such as download time and network congestion is also evaluated. The introduced framework enhances the overall performance of the network by reducing the amount of network traffic and also enhancing the download time with help of minimizing the redundant packet transmission. The performance analysis with respect to packet redundancy is illustrated in Fig. 7. It presents insights into the efficiency of the proposed framework in controlling redundant packet transmission as well as improving the overall performance of network. In the proposed framework, download time is another significant performance measure which reflects the time taken by any of the peer to download any particular video content. The performance of this work is assessed in terms of the download time. It is the time taken by a peer to download a particular video content. The performance of the framework has been analyzed and evaluated by comparing the download time of video content with that of traditional peer selection methods as mentioned above. The simulation have been modeled and designed to change the number of peers, the size of content, and the network conditions such as bandwidth and also latency. The performance analysis with respect to download time presents insights into the effectiveness of the proposed framework in addressing the peer selection. It is represented in Fig. 8.

Table 3:

Utilization of bandwidth capacity.

Number of peers	% of utilization of upload capacity of the peers				% of utilization of download capacity of the peers
Number of peers	RPS	Choke	Heuristics	FLPPS	RPS	Choke	Heuristics	FLPPS
100	94.2	94.3	94.5	96.1	39.5	39.8	41.5	42.5
150	94.3	94.8	95.3	96.5	41.2	41.5	42.1	42.8
200	94.5	95.1	95.7	96.6	42.2	42.6	42.8	43.2
250	94.9	95.2	96.7	97.1	42.5	42.8	42.9	43.6
300	95.2	95.3	96.8	97.5	42.8	43.1	43.5	44.5
350	95.6	95.5	97.2	97.8	42.9	43.4	43.8	44.9
400	95.9	96.2	97.5	98.1	43.1	43.5	44.2	45.1
450	96.1	96.5	97.6	98.5	43.5	44.1	44.5	45.5
500	96.5	96.8	98.2	98.6	43.6	44.2	44.8	45.6
600	96.9	97.2	98.5	99.9	43.8	44.1	45.5	46.5

DOI: 10.7717/peerj-cs.2701/table-3

Figure 7: Packet redundancy measure.

Download full-size image

DOI: 10.7717/peerj-cs.2701/fig-7

Figure 8: Download time measure.

Download full-size image

DOI: 10.7717/peerj-cs.2701/fig-8

The performance evaluation of proposed work has also been done using DCR. It is depicted in Fig. 9. The ratio between the number of the chunks of content that have been successfully downloaded by end user and the total number of chunks of the entire content is defined as DCR. Therefore, a higher DCR is needed to achieve the higher performance and the proposed framework also achieves the same. The performance is also further evaluated in terms of its ability to pick the most suitable peers to approach for downloading the content. It can be carried out by analyzing the DCR for different number of peers. The average DCR in the proposed system for all the peers is higher value. The performance of a P2P CDN is enhanced by restricting the network overhead caused by peer selection process. It is illustrated in Fig. 10. The FLP based one is proved to be an efficient and feasible method to solve the problem of peer selection with respect to network overhead. It provides a simple yet effective technique for controlling the network overhead at the same time optimizing the overall performance of the network. It also allows for the selection of peers using their availability and capacity, which aids to minimize the amount of network overhead drastically. It happens since choosing the peers with high availability and also capacity at the same time network is able to disseminate the content more efficiently and quickly. The effectiveness in selecting peers is evaluated with sufficient amount of resources to contribute for content distribution by assessing the performance of the framework based on upload capacity measure. This measure provides some insights into the proposed framework’s capability to optimize and effective usage of available resources, enhances content dissemination speed, and also improves the overall performance.

Figure 9: Download completion rate measure.

Download full-size image

DOI: 10.7717/peerj-cs.2701/fig-9

Figure 10: Network overhead computation.

Download full-size image

DOI: 10.7717/peerj-cs.2701/fig-10

Statistical tests

In establishing the improvements in the proposed optimal peer selection system, statistical tests such as confidence interval and p-value tests are carried out to determine the reliability of the observed performance improvement through simulation on different runs. With the use of confidence intervals and p-values, there is a possibility to confirm externally whether the FLP-based peer selection approach indeed results in significant and real performance gains. Confidence intervals (CIs) are a statistical technique for making estimates of a range within which a specified parameter may be expected to lie at a certain level of confidence (usually 95% or 99%). Confidence intervals are useful in assessing the validity of a specific proposed approach in the sense of establishing the stability of the observed changes. Table 4 represents validation of confidence intervals on the improvements from an FLP-based approach in a P2P video content distribution network. Table 5 represents validation of p-values for the improvements from an FLP-based approach in a P2P video content distribution.

Table 4:

Confidence interval measure.

Metric	Observed mean improvement	Standard deviation	Sample size (n)	Confidence level	Z-Score/t-Score	Confidence Interval (CI)	Interpretation
Content retrieval latency	10 ms	2 ms	50	95%	1.96	[9.45–10.55 ms]	We are 95% confident that the true average latency reduction is between 9.45 and 10.55 ms.
Network throughput	20% improvement	3%	40	99%	2.58	[18.8–21.2%]	With 99% confidence, the true throughput improvement lies between 18.8% and 21.2%.
Server idle time reduction	15% decrease	4%	30	95%	2.04 (t-score)	[13.5–16.5%]	There is a 95% confidence that the system reduces idle time by 13.5% to 16.5%.
Resource utilization efficiency	25% improvement	5%	45	90%	1.645	[23.2–26.8%]	At 90% confidence, resource utilization efficiency gains fall within 23.2% to 26.8%.

DOI: 10.7717/peerj-cs.2701/table-4

Table 5:

p-values.

Metric	Hypothesis tested	Observed test statistic	Degrees of freedom	p-value	Significance level (α)	Decision	Interpretation
Content retrieval latency	Null: No reduction in latency	2.45	49	0.020	0.05	Reject null hypothesis	The reduction in latency is statistically significant, suggesting the FLP approach improves retrieval times.
Network throughput	Null: No improvement in throughput	3.10	39	0.002	0.01	Reject null hypothesis	The increase in throughput is statistically significant, indicating effective optimization through the proposed method.
Server idle time reduction	Null: No reduction in server idle time	1.85	29	0.075	0.05	Fail to reject null hypothesis	The reduction in idle time is not statistically significant at the 5% level, suggesting variability in results.
Resource utilization efficiency	Null: No improvement in resource utilization	4.25	44	0.0005	0.01	Reject null hypothesis	The improvement in resource utilization is highly significant, supporting the efficacy of the FLP approach.

DOI: 10.7717/peerj-cs.2701/table-5

Result and discussion

The use of FLP in peer selection policy in P2P content distribution networks signifies remarkable progress compared to conventional and heuristic-based techniques. Our research, which included systematic simulations and comparative evaluation, has demonstrated the effectiveness of this proposed strategy. The metrics such as download duration, download completion rate, network overhead, packet redundancy, and the utilization of upload and download capabilities are evaluated. Contrasting the traditional and heuristic mechanism, the experimental findings, as described in the previous sections, reveal that FLP-based peer selection surpass random selection of peer, choke algorithms, and several heuristic methods. FLP has consistently confirmed to enhance the download speeds and completion rates across various network sizes, spanning from the number of peers from 100 to 600. The enhancement is attributed to proposed FLP’s ability to incorporate several decision-making criteria concurrently, taking into consideration of the uncertain nature of network dynamics and peer capabilities. The impact of FLP on network overhead and packet redundancy further emphasizes its superiority. The FLP technique reduces unnecessary data transfers in the network by carefully selecting peers based on their availability, bandwidth, and storage capacity. This optimization not only minimizes the amount of data transmitted but also reduces the burden on individual peers, therefore enhancing the overall network performance and user experience. An effect on network performance metrics, the analysis of bandwidth capacity, as outlined in our simulations, reveals the FLP’s capacity to optimize the exploitation of accessible resources. Peers selection using FLP demonstrated better levels of upload and download capacity utilization in comparison to peers selected using alternative and traditional approaches. FLP minimizes packet redundancy, resulting in decreased data transfer load and mitigating bottlenecks, therefore improving the efficiency of content delivery. The implementation of FLP presents a novel method to deal with the uncertainties that naturally exist in P2P networks. It presents a well-organized yet flexible system for dealing with the several factors involved in selecting peers while also balancing efficiency and fairness in allocating available resources. Implementing the FLP based peer selection method can be used to greatly enhance the performance of P2P CDN. It is a most powerful and hopeful technique and an alternate technique for choosing peers, although it does have some drawbacks. The computational complication of solving FLP problem, mainly in broad networks, presents a difficult problem. A potential future research opportunity would be evaluating and examining the formation of more streamlined algorithms for handling FLP problems and integrating some of the machine learning techniques to forecast optimal peer selections from past historical data. Additionally, future research might explore the impact and influence the performance of other fuzzy membership functions and decision making criteria on the efficiency of the FLP framework.

Time complexity analysis of the proposed algorithm

The time complexity of the proposed approach is primarily driven by the steps of peer evaluation, FLP formulation, and solving the optimization problem. It is carried out in this section. Each peer $P_{i}$ has $m$ attributes. The fuzzy membership function for a given attribute $a_{j}$ is:

(10) $μ_{a j} (P_{i}) = f_{a j} (x_{i}), \forall_{i} = 1, \dots, n a n d j = 1, \dots, m$ where $f_{a j} (x_{i})$ is a fuzzy membership function that maps the attribute value $x_{i}$ of peer $P_{i}$ to a membership degree $μ_{a j} (P_{i})$ . This operation is typically done using a mathematical function such as trapezoidal. The time complexity for calculating the fuzzy membership for each peer and each attribute is $O (1),$ since it involves a simple evaluation of the fuzzy function. Thus, the total time complexity for computing the fuzzy membership values across all $n$ peers and $m$ attributes is:

(11) $T_{f u z z y m e m b e r s h i p} (n, m) = O (n . m)$

For each attribute $a_{j}$ and each peer $P_{i}$ , we perform normalization as follows:

(12) $μ_{a j}^{'} (P_{i}) = \frac{μ_{a j} (P_{i}) - μ_{m i n} (a_{j})}{μ_{m a x} (a_{j}) - μ_{m i n} (a_{j})}, \forall i = 1, \dots, n a n d j = 1, \dots, m$

The time complexity for normalization operation is:

(13) $T_{n o r m a l i z a t i o n} (n, m) = O (n . m)$

The objective function $f (x)$ is typically a weighted sum of the fuzzy membership values of various attributes.

(14) $f (x) = \sum_{i = 1}^{n} (α_{i} . μ_{b a n d w i d t h} (P_{i}) - β_{i} . μ_{l a t e n c y} (P_{i})) + γ . μ_{a v a i l a b i l i t y} (P_{i})$ where $α_{i}, β_{i} a n d γ$ are weights for the respective attributes and $μ_{b a n d w i d t h} (P_{i}), μ_{l a t e n c y} (P_{i}), μ_{a v a i l a b i l i t y} (P_{i})$ are the normalized fuzzy membership values for bandwidth, latency, and availability for peer $P_{i}$ . Let the set of constraints for the linear program be defined as:

(15) $A . x \leq b,$ where $A$ is an $L * p$ matrix of constraints (typically, $L = n$ ), $x$ is a vector of decision variables (i.e., the selection decision for each peer), $b$ is a vector of bounds for the constraints. The time complexity for formulating the FLP problem is:

(16) $T_{f o r m u l a t i o n} (n, m) = O (n . m)$

Using the Simplex algorithm, the general time complexity for solving a linear programming problem is

(17) $T_{L P s o l v e r} (n) = O (n^{3})$

This cubic complexity arises due to the iterative nature of solving linear programs, the matrix operations and simplex (pivoting) method.

(18) $T_{p e e r s e l e c t i o n} (n) = O (n)$

Overall time complexity of the algorithm:

(19) $T (n, m) = O (n . m) + O (n . m) + O (n . m) + O (n^{3}) + O (n)$

The $O (n^{3})$ term dominates the overall time complexity, as it is the largest term. Therefore, the overall time complexity of the algorithm is:

(20) $T (n, m) = O (n^{3}) .$

Granular computing

There is a possibility to enhance our research with incorporation of the granular computing-based decision models in our future work. Granular computing (GrC) is a powerful computational paradigm used to solve complex problems by breaking them down into granules where the information packets of varying sizes and also levels of abstraction. These granules are constructed based on the natural clustering of data, which is particularly useful in decision-making processes that involve imprecise, uncertain, or incomplete information. In the context of P2P networks, GrC-based models can help to manage the inherent uncertainty and heterogeneity of peers, network conditions, and resource availability. By using GrC, decision models can classify peers into various granules based on their attributes, such as bandwidth, reliability, and availability. These granules help to simplify and structure complex decision processes by abstracting the detailed peer characteristics into more manageable clusters, which allows for more effective and scalable decision-making. In P2P video content distribution, GrC-based models could improve the peer selection process by creating decision-making hierarchies that consider multiple attributes at different levels of granularity. While our current approach primarily focuses on FLP to address uncertainties, GrC-based models share a similar objective of handling complexity and uncertainty in decision-making. In future extensions of our work, GrC could be integrated with FLP to further enhance the peer selection process by creating more robust and scalable solutions that incorporate both granular abstraction and fuzzy decision-making.

Conclusion

An efficient system is introduced in this article to deal with the problem of peer selection in P2P content distribution network using fuzzy linear programming. It combines the benefits of fuzzy sets and linear programming technique to work and deal with the uncertainty and imprecision in the selection criteria along with decision variables. It also consists of a sequence of steps which converts the FLP problem to an LP problem with correct constraints using alpha cuts; then, it can be solved using typical and simple linear programming algorithms. The objective function is devised based on the fuzzy decision variables and selection criteria to exploit the efficiency of content distribution. The membership functions for the fuzzy decision variables characterize the degree to which each peer satisfies the related criterion. The framework is used to obtain solution for large scale problems with a large number of peers and some more criterions for selection. In addition, it is easily extendable to include additionally new selection criteria and also permit to modify the existing criteria. The proposed framework has the potential to address the peer selection problem in P2P content distribution network. It has the ability to enhance the performance and reliability of P2P content video distribution in network, and can be used in various contexts such as large scale file sharing, multimedia streaming, and other content delivery in networks. Efficient caching of content at some of the selected peers may reduce the load on the network and improve the delivery of content. Hence, research into optimal caching policies and replacement approaches are part of our future research in this domain.

[1] Ali MS, Vecchio M, Putra GD, Kanhere SS, Antonelli F. 2020. A decentralized P2P remote health monitoring system. Sensors 20(6):1656

[2] Anandaraj M, Ganeshkumar P, Naganandhini S, Selvaraj K. 2024. A novel fuzzy programming approach for piece selection problem in P2P content distribution network. PeerJ Computer Science 10(6):e1645

[3] Anandaraj M, Selvaraj K, Ganeshkumar P, Rajkumar K, Sriram S. 2021. Genetic algorithm-based resource minimization in network code-Based P2P network. Journal of Circuits, Systems, and Computers 30(8):2150092

[4] Budhkar S, Tamarapalli V. 2020. An overlay management strategy to improve QoS in CDN-P2P live streaming systems. Peer-to-Peer Networking and Applications 13(1):190-206

[5] D’Alessandro Costa MA, Gonçalves Rubinstein M. 2019. Performance analysis of a locality-aware BitTorrent protocol in enterprise networks. Peer-to-Peer Networking and Applications 12(4):751-762

[6] Edalatpanah SA. 2023. Multidimensional solution of fuzzy linear programming. PeerJ Computer Science 9(1):e1646

[7] Farahani R, Çetinkaya E, Timmerer C, Shojafar M, Ghanbari M, Hellwagner H. 2023. ALIVE: a latency- and cost-aware hybrid P2P-CDN framework for live video streaming. IEEE Transactions on Network and Service Management 21(2):1561-1580

[8] Gebraselase BG, Helvik BE, Jiang Y. 2022. Bitcoin P2P network measurements: a testbed study of the effect of peer selection on transaction propagation and confirmation times. IEEE Transactions on Network and Service Management 19(4):3975-3987

[9] Geng J, Fujita S. 2024. Enhancing crowd-sourced video sharing through P2P-assisted HTTP video streaming. Electronics 13(7):1265-1275

[10] Ghasemkhani H, Li Y-M, Moinzadeh K, Tan Y. 2018. Contracting models for P2P content distribution. Production and Operations Management, Production and Operations Management Society 27(11):1940-1959

[11] Goguen JA, Zadeh LA. 1973. Fuzzy sets. Information and control, vol. 8 (1965), pp. 338–353. - L. A. Zadeh. Similarity relations and fuzzy orderings. Information sciences, vol. 3 (1971), pp. 177–200. Information and Control 38(4):656-657

[12] Han SC, Nam KW. 2019. An optimal peer selection algorithm for mesh-based Peer-to-Peer networks. KSII Transactions on Internet and Information Systems 13(1):133-151

[13] Hwang IS, Rianto A, Kharga R, Ab-Rahman MS. 2022. Global P2P BitTorrent real-time traffic over SDN-based local-aware NG-PON2. IEEE Access 10:76884-76894

[14] Izhak-Ratzin R, Park H, Schaar MVD. 2012. Online learning in BitTorrent systems. IEEE Transactions on Parallel and Distributed Systems 23(12):2280-2288

[15] Kumar D, Pandey M. 2022. An optimal and secure resource searching algorithm for unstructured mobile peer-to-peer network using particle swarm optimization. Applied Intelligence 52(13):14988-15005

[16] Liu Y, Sakamoto S, Matsuo K, Ikeda M, Barolli L, Xhafa F. 2016. A comparison study for two fuzzy-based systems: improving reliability and security of JXTA-overlay P2P platform. International Journal of Soft Computing 20(7):2677-2687

[17] Loganathan T, Ganesan K. 2021. Solution of fully Fuzzy Linear Fractional Programming Problem-A Simple Approach. IOP Conference Series: Materials Science and Engineering 1130:012047

[18] Luo S, Yu H, Li K, Xing H. 2020. Efficient file dissemination in data center networks with priority-based adaptive multicast. IEEE Journal on Selected Areas in Communications 38(6):1161-1175

[19] Masood S, Shahid MA, Sharif M, Yasmin M. 2018. Comparative analysis of Peer to Peer network. International Journal of Advanced Networking and Applications 9(4):3477-3491

[20] Meng XF. 2018. SpeedTrust: a super peer-guaranteed trust model in hybrid P2P networks. The Journal of Supercomputing 74(6):2553-2580

[21] Nacakli S, Tekalp AM. 2021. Controlling P2P-CDN live streaming services at SDN-enabled multi-access edge datacenters. IEEE Transactions on Multimedia 23:3805-3816

[22] Naganandhini S, Shanthi D. 2023. Optimizing replication of data for distributed cloud computing environments: techniques, challenges, and research gap.

[23] Nasseri SH, Verdegay JL, Mahmoudi F. 2021. A new method to solve fuzzy interval flexible linear programming using a multi-objective approach. Fuzzy Information and Engineering 13(2):248-265

[24] Nguyen A-T, Taniguchi T, Eciolaza L, Campos V, Palhares R, Sugeno M. 2019. Fuzzy control systems: past, present and future. IEEE Computational Intelligence Magazine 14(1):56-68

[25] Nie L, Yang S, Zheng X, Wang X. 2023. An efficient and adaptive content delivery system based on hybrid network. IEEE Transactions on Broadcasting 69(4):904-915

[26] Ren Y, Zeng Z, Wang T, Zhang S, Zhi G. 2020. A trust-based minimum cost and quality aware data collection scheme in P2P network. Peer-to-Peer Networking and Applications 13(6):2300-2323

[27] Rivaz S, Nasseri SH, Ziaseraji M. 2020. A fuzzy goal programming approach to multiobjective transportation problems. Fuzzy Information and Engineering 12(2):139-149

[28] Safara F, Souri A, Deiman SF. 2020. Super peer selection strategy in peer-to-peer networks based on learning automata. International Journal of Communication Systems 33(6):49

[29] Surati S, Jinwala DC, Garg S. 2017. A survey of simulators for P2P overlay networks with a case study of the P2P tree overlay using an event-driven simulator. Engineering Science and Technology an International Journal 20(2):705-720

[30] Villacorta P, Rabelo C, Pelta D, Verdegay J. 2017. FuzzyLP: an r package for solving fuzzy linear programming problems. In: Kacprzyk J, Filev D, Beliakov G, eds. Granular, Soft and Fuzzy Approaches for Intelligent Systems: Dedicated to Professor Ronald R. Yager, chapter 11, 209–230. Berlin, Germany: Springer International Publishing. 11:209-230

[31] Wang Z, Baydaş M, Stevic Ž, Ozçil A, Irfan SA, Wu Z, Rangaiah GP. 2023. Comparison of fuzzy and crisp decision matrices: an evaluation on PROBID and sPROBID multi-criteria decision-making methods. Demonstratio Mathematica 56(1):20230117

[32] Xue B, Mao Y, Venkatakrishnan SB, Kannan S. 2023. Goldfish: peer selection using matrix completion in unstructured P2P network.

[33] Yang X-P, Zheng G. 2021. Maximum number of line faults in a P2P network system based on the addition-min fuzzy relation inequalities. IEEE Transactions on Fuzzy Systems 30(7):2241-2253