首页期刊导航|Future generation computer systems: FGCS
期刊信息/Journal information
Future generation computer systems: FGCS
Elsevier Science
Future generation computer systems: FGCS

Elsevier Science

0167-739X

Future generation computer systems: FGCS/Journal Future generation computer systems: FGCSEI
正式出版
收录年代

    Run time dynamic digital twins and dynamic digital twins networks

    Alexander VodyahoRadhakrishnan DelhibabuDmitry I. IgnatovNataly Zhukova...
    107823.1-107823.18页
    查看更多>>摘要:Digital twins are widely used for building various types of cyber-physical systems. There are a huge number of publications devoted to the use of digital twins in production systems. Much less attention is paid to th4e issues of building runtime digital twins. The article describes an approach to building complex distributed cyber-physical systems with a high level of architectural dynamics built on fog and edge computing platforms based on the use of digital twins. The issues of implementing runtime digital twins and distributed systems of runtime digital twins are considered. The requirements to runtime digital twins are defined. Typical problem statements for constructing and maintaining a runtime digital twin system are formulated. A reference architecture of a dynamic runtime digital twin is proposed, which includes a model of the observed system (or the object) and a model processor. The dynamic model of the observed and managed system is considered as a key element of the digital twin. Possible approaches to the synthesis of built-in models of runtime digital twins are discussed. Examples of using the proposed approach to solve practical problems are given. The described approach may be of interest to specialists involved in research and development of various types of information systems implemented on Internet of Things platforms, such as smart cities, smart transport, medical information systems, etc. It is proposed to conduct further research and development in the areas of creating human digital twins.

    Evaluating privacy loss in differential privacy based federated learning

    Shangyin WengYan GouLei ZhangMuhammad Ali Imran...
    107848.1-107848.10页
    查看更多>>摘要:Federated learning (FL) trains a global model by aggregating local training gradients, but private information can be leaked from these gradients. To enhance privacy, differential privacy (DP) is often used by adding artificial noise. However, this approach reduces accuracy compared to noise-free learning. Balancing privacy protection and model accuracy remains a key challenge for DP-based FL. Additionally, current methods use theoretical bounds to measure privacy loss, lacking an intuitive assessment. In this paper, we first propose an evaluation method for privacy leakage in the FL by utilizing reconstruction attacks to analyze the difference between the original images and reconstructed ones. We then formulate the problems of investigating DP's effect on the reconstruction attack, where we study the accumulative privacy loss under two different reconstruction attack settings and prove that anonymous local clients can decrease the probability of privacy leakage. Next, we study the effects of different clipping methods, including fixed constants and the median value of the unclipped gradients' norm, on privacy protection and learning performance. Furthermore, we derive the theoretical convergence analysis for the cosine similarity and l_2 -norm-based reconstruction attack under DP noise. We conduct extensive simulations to show how DP settings affect privacy leakage and characterize the trade-off between privacy protection and learning accuracy.

    AS2: Adaptive sorting algorithm selection for heterogeneous workloads and systems

    Sangmyung LeeByungyoon LeeYongseok SonKiwook Sohn...
    107860.1-107860.15页
    查看更多>>摘要:Sorting is becoming increasingly important in modern computing, ranging from small-scale Internet of Things (IoT) devices to supercomputers. To improve sorting performance, various algorithms, including Intro sort, Merge sort, Heap sort, and Insertion sort, are adopted in different systems. However, the performance of sorting algorithms depends on various factors, and our analysis shows that the optimal algorithm varies, with no single algorithm consistently outperforming the others. In this paper, we first analyze data internal factors (data size, distribution, data type) and external factors (threads, different hardware) that impact sorting algorithm performance. We utilize widely adopted sorting algorithms such as STL sort and Merge sort, as well as state-of-the-art sorting algorithms like Ips4o sort and Aips2o sort. In addition to sequential sorting algorithms, we implement Parallel Intro sort and utilize the parallel versions of state-of-the-art sorting algorithms with varying number of threads. From the analysis, we present an adaptive sorting algorithm selection model for heterogeneous workloads and systems, called AS2 (Adaptive Sorting Algorithm Selection). Its goal is to determine the optimal algorithm from the existing sorting algorithms in heterogeneous workloads and systems. AS2 uses various ML models to build performance models for each sorting algorithm using data internal and external factors from various datasets. Then, AS2 chooses the optimal sorting algorithm based on the performance prediction using the model. We evaluate AS2 using a representative dataset that includes various data internal and external factors. The results show that AS2 can accurately predict the performance of various sorting algorithms, with min and max r-squared values of 0.83 and 0.99, respectively. In addition, AS2 successfully selects the optimal algorithm in our evaluation scenario up to 99.68% accuracy by choosing the algorithm with the shortest predicted sorting time, improving performance by up to 1.83× compared to the state-of-the-art algorithm. We also evaluate the performance of AS2 using the real-world dataset and the results show that AS2 selects the optimal algorithm with 87.50% accuracy.

    Multi-agent deep reinforcement learning based multi-task partial computation offloading in mobile edge computing

    Han LiShunmei MengJin SunZhicheng Cai...
    107861.1-107861.11页
    查看更多>>摘要:Mobile edge computing (MEC) can enhance the computation performance of end-devices by providing computation offloading service at the network edge. However, given that both end-devices and edge servers have finite computation resources, inefficient offloading policies may lead to overload, thereby increasing the computation delays of tasks. In this paper, we investigate a multi-task partial computation offloading problem combined with a queue model. Based on achieving load-balancing across the MEC system, our objective is to minimize the long-standing average task-processing cost of the end-devices while ensuring the delay thresholds of tasks. For this purpose, a distributed offloading algorithm utilizing the multi-agent deep reinforcement learning (MADRL) method is proposed. Specifically, through interacting with the MEC environment and accumulating experience data, the device agents can collaborate to optimize their local offloading decisions over continuous time-slots, which includes adjusting the transmission power and determining the tasks' offloading ratios under the dynamic wireless channel conditions. Exhaustive experimental results demonstrate that in contrast with the baseline algorithms, the proposed offloading algorithm can not only better balance the computation loads between the end-devices and the MEC server, but also more effectively reduce the task-processing cost of the end-devices, as well as the percentage of timeout tasks.

    A multi-agent architecture for context sources integration in smart cities

    Leonardo Vianna do NascimentoJose Palazzo Moreira de Oliveira
    107862.1-107862.15页
    查看更多>>摘要:Contextual data in smart cities are present in large quantities and distributed sources. Many applications can benefit from these data to provide better services to their users. The scale and dynamic nature of urban environments pose significant challenges in making context sources available to applications. These challenges involve transparent access to context, resilience, decentralization, extensibility, scalability, and redundancy of data. This study introduces a new architecture designed to address these issues. This architecture aims to facilitate the acquisition of context by integrating distributed data sources. The developed architecture not only overcomes the challenges posed by the scale and dynamicity of urban environments but also prepares for more innovative and effective solutions for smart cities. The architecture is distributed, decentralized, and fault-tolerant, providing data fusion mechanisms and dynamic context source composition. Compared to existing works, our architecture contributes to the state-of-the-art addressing all these five challenges in one design. The architecture uses the multi-agent paradigm, which is inherently distributed and facilitates decentralization. A scenario was used to execute several experiments demonstrating that the architecture can obtain context data transparently by any application.

    Harnessing quality-throughput trade-off in scoring functions for extreme-scale virtual screening campaigns

    Yuedong ZhangGianmarco AccordiDavide GadioliGianluca Palermo...
    107863.1-107863.13页
    查看更多>>摘要:Drug discovery is a long and costly process aimed at finding a molecule that yields a therapeutic effect. Virtual screening is one of the initial in-silico steps that aims at estimating how promising a molecule is. This stage needs to solve two well-known domain problems: molecular docking and scoring. While the accuracy of scoring functions is extensively investigated in comparisons, the execution time of their implementation is usually not considered. In virtual screening campaigns, the definition of a fixed time budget for the entire process and the average time required to process each molecule determines the upper limit of the number of molecules that can be evaluated. By reducing the time needed to evaluate a single molecule, we can screen a larger number of molecules, thereby increasing the possibility of finding a promising solution. For extreme-scale virtual screening campaigns, the computational budget is a critical aspect since even utilizing large-scale facilities would make it impractical to complete the screening within a feasible time unless the computational time for a single molecule is significantly reduced. In this paper, we explore optimization and approximation techniques applied to two well-known scoring functions, which we modify to investigate different accuracy-performance trade-offs to support large-scale virtual screening campaigns. Despite the different approaches we considered, experimental results demonstrate that the proposed enhancements achieve better enrichment factors in virtual screening scenarios. Moreover, we port both implementations to CUDA to show that the proposed techniques are GPU-friendly and aligned with modern supercomputing infrastructures.

    X-DINC: Toward Cross-Layer Approximation for the Distributed and In-Network ACceleration of Multi-Kernel Applications

    Zahra EbrahimiMaryam EslamiXun XiaoAkash Kumar...
    107864.1-107864.16页
    查看更多>>摘要:With the rapid evolution of programmable network devices and the urge for energy-efficient and sustainable computing, network infrastructures are mutating toward a computing pipeline, providing In-Network Computing (INC) capability. Despite the initial success in offloading single/small kernels to the network devices, deploying multi-kernel applications remains challenging due to limited memory, computing resources, and lack of support for Floating Point (FP) and complex operations. To tackle these challenges, we present a cross-layer approximation and distribution methodology (X-DINC) that exploits the error resilience of applications. X-DINC utilizes a chain of techniques to facilitate kernel deployment and distribution across heterogeneous devices in INC environments. First, we identify approximation and optimization opportunities in data acquisition and computation phases of multi-kernel applications. Second, we simplify complex arithmetic operations to cope with the computation limitations of the programmable network switches. Third, we perform application-level sensitivity analysis to measure the trade-off between performance gain and Quality of Results (QoR) loss when approximating individual kernels via various techniques. Finally, a greedy heuristic swiftly generates Pareto/near-Pareto mixed-precision configurations that maximize the performance gain while maintaining the user-defined QoR. X-DINC is prototyped on a Virtex-7 Field Programmable Gate Array (FPGA) and evaluated using the Blind Source Separation (BSS) application on industrial audio dataset. Results show that X-DINC performs separation up to 35% faster with up to 88% lower Area-Delay Product (ADP) compared to an Accurate-Centralized approach, when distributed across 2 to 7 network nodes, while maintaining audio quality within an acceptable range of 15-20 dB.

    Adaptive container auto-scaling for fluctuating workloads in cloud

    Xiaoyue FengSijia ZhangTianzhe JiaoChaopeng Guo...
    107872.1-107872.13页
    查看更多>>摘要:Database-as-a-Service(DBaaS) provides services for multiple tenants through resource containers, which are allowed to scale over time to fulfill the service-level agreements. Designing container auto-scaling methods for DBaaS can help reduce their expenditure. Reinforcement Learning (RL) shows powerful performance in cloud resource scaling due to its robustness in dynamic environments. However, the RL-based methods fail to maintain high performance for fluctuating workloads since their fixed-action design cannot adapt to numerous variations of the resource demand. This paper proposes an adaptive container auto-scaling method called Asner that includes an improved RL-based algorithm with a dynamic action model to solve the problem of fixed-action design. Asner consists of a resource estimation model (Estimator) and a RL-based scaling algorithm (Scaler). Estimator adopts a graph-based method to estimate the workload resource demand for container scaling. Scaler generates the container scaling strategy by employing an improved RL-based algorithm with a dynamic action model for adapting to the fluctuating workload. Our experiment results show that Estimator achieves about 93% accuracy under the TPC-DS dataset, Scale's performance is about 30% higher than the state-of-the-art RL, and Asner improves its performance by up to 45% compared to other methods.

    Improving self-supervised vertical federated learning with contrastive instance-wise similarity and dynamical balance pool

    Shuai ChenWenyu ZhangXiaoling HuangCheng Zhang...
    107884.1-107884.14页
    查看更多>>摘要:Vertical Federated Learning (VFL) enables multiple parties with distinct feature spaces to train a joint VFL model collaboratively without exposing their original private data. In realistic scenarios, the scarcity of aligned and labeled samples among collaborating participants limits the effectiveness of traditional VFL approaches for model training. Current VFL frameworks attempt to leverage abundant unlabeled data using Contrastive Self-Supervised Learning (CSSL). However, the simplistic incorporation of CSSL methods cannot address severe domain shift in VFL. In addition, CSSL methods typically conflict with general regularization approaches designed to alleviate domain shift, thereby significantly limiting the potential of the self-supervised learning framework in VFL. To address these challenges, this study proposes an Improved Self-Supervised Vertical Federated Learning (ISSVFL) framework for VFL in label-scarce scenarios under the semi-honest and no-collusion assumption. ISSVFL merges CSSL with instance-wise similarity to resolve regularization conflicts and captures more significant inter-domain knowledge in the representations from different participants, effectively alleviating domain shift. In addition, a new dynamical balance pool is proposed to fine-tune the pre-trained models for downstream supervised tasks by dynamically balancing inter-domain and intra-domain knowledge. Extensive empirical experiments on image and tabular datasets demonstrate that ISSVFL achieves an average performance improvement of 3.3 % compared with state-of-the-art baselines.

    Efficient edge-based data integrity auditing in cloud storage

    Hao YanYan WangGuoxiu LiuJuan Zhao...
    107899.1-107899.8页
    查看更多>>摘要:Edge computing increasingly collaborates with cloud computing to support numerous applications that involve large data volumes and frequent data interactions. In cloud-edge collaboration environments, applications especially with high requirements for low data transmission delay often deploy frequently accessed client data replicas on edge servers to improve data access efficiency. Consequently, client data is often distributed across both cloud and edge servers in practice. Therefore, efficiently verifying the integrity of all client data poses a complex and urgent challenge. To address this issue, the paper introduces a novel data integrity auditing scheme capable of efficiently performing asynchronous integrity checks on client data across both edge and cloud servers. In our scheme, clients only generate partial block tags and upload them along with the data to the edge server. Edge server computes complete tags based on the partial tags, caches a small portion of frequently accessed data, and transfers the remaining data to the cloud server. For data verification, edge servers provide partial integrity proofs for cached data, supporting the cloud server to generate complete proofs for all challenged data. Thus, the auditors can verify all client data, regardless of its storage location. In our scheme, edge clients bear only about half of the computational workload of existing schemes. Additionally, the cloud server also offloads a portion of computational and storage tasks to edge servers, significantly improving the overall efficiency of data checking. We theoretically prove the security of our scheme, and experimental results demonstrate its efficiency and feasibility.