首页期刊导航|计算机科学前沿
期刊信息/Journal information
计算机科学前沿
高等教育出版社,Springer
计算机科学前沿

高等教育出版社,Springer

季刊

2095-2228

100029

北京市朝阳区惠新东街4号富盛大厦15层

计算机科学前沿/Journal Frontiers of Computer ScienceCSCDCSTPCDEISCI
查看更多>>涉及领域包括(但不限于)软件工程,计算机体系结构,程序设计理论,算法,人工智能,计算机图形学与虚拟现实,因特网,安全与密码学,生物信息,中文信息处理,智能人机接口,数据库以及相关新兴学科和交叉学科。
正式出版
收录年代

    Hybrid concurrency control protocol for data sharing among heterogeneous blockchains

    Tiezheng GUOZhiwei ZHANGYe YUANXiaochun YANG...
    1-12页
    查看更多>>摘要:With the development of information technology and cloud computing,data sharing has become an important part of scientific research.In traditional data sharing,data is stored on a third-party storage platform,which causes the owner to lose control of the data.As a result,there are issues of intentional data leakage and tampering by third parties,and the private information contained in the data may lead to more significant issues.Furthermore,data is frequently maintained on multiple storage platforms,posing significant hurdles in terms of enlisting multiple parties to engage in data sharing while maintaining consistency.In this work,we propose a new architecture for applying blockchains to data sharing and achieve efficient and reliable data sharing among heterogeneous blockchains.We design a new data sharing transaction mechanism based on the system architecture to protect the security of the raw data and the processing process.We also design and implement a hybrid concurrency control protocol to overcome issues caused by the large differences in blockchain performance in our system and to improve the success rate of data sharing transactions.We took Ethereum and Hyperledger Fabric as examples to conduct cross-blockchain data sharing experiments.The results show that our system achieves data sharing across heterogeneous blockchains with reasonable performance and has high scalability.

    A disk I/O optimized system for concurrent graph processing jobs

    Xianghao XUFang WANGHong JIANGYongli CHENG...
    13-29页
    查看更多>>摘要:In order to analyze and process the large graphs with high cost efficiency,researchers have developed a number of out-of-core graph processing systems in recent years based on just one commodity computer.On the other hand,with the rapidly growing need of analyzing graphs in the real-world,graph processing systems have to efficiently handle massive concurrent graph processing(CGP)jobs.Unfortunately,due to the inherent design for single graph processing job,existing out-of-core graph processing systems usually incur unnecessary data accesses and severe competition of I/O bandwidth when handling the CGP jobs.In this paper,we propose GraphCP,a disk I/O optimized out-of-core graph processing system that efficiently supports the processing of CGP jobs.GraphCP proposes a benefit-aware sharing execution model to share the I/O access and processing of graph data among the CGP jobs and adaptively schedule the graph data loading based on the states of vertices,which efficiently overcomes above challenges faced by existing out-of-core graph processing systems.Moreover,GraphCP adopts a dependency-based future-vertex updating model so as to reduce disk I/Os in the future iterations.In addition,GraphCP organizes the graph data with a Source-Sorted Sub-Block graph representation for better processing capacity and I/O access locality.Extensive evaluation results show that GraphCP is 20.5× and 8.9× faster than two out-of-core graph processing systems GridGraph and GraphZ,and 3.5× and 1.7× faster than two state-of-art concurrent graph processing systems Seraph and GraphSO.

    Accelerating BERT inference with GPU-efficient exit prediction

    Lei LIChengyu WANGMinghui QIUCen CHEN...
    31-42页
    查看更多>>摘要:BERT is a representative pre-trained language model that has drawn extensive attention for significant improvements in downstream Natural Language Processing(NLP)tasks.The complex architecture and massive parameters bring BERT competitive performance but also result in slow speed at model inference time.To speed up BERT inference,FastBERT realizes adaptive inference with an acceptable drop in accuracy based on knowledge distillation and the early-exit technique.However,many factors may limit the performance of FastBERT,such as the teacher classifier that is not knowledgeable enough,the batch size shrinkage and the redundant computation of student classifiers.To overcome these limitations,we propose a new BERT inference method with GPU-Efficient Exit Prediction(GEEP).GEEP leverages the shared exit loss to simplify the training process of FastBERT from two steps into only one step and makes the teacher classifier more knowledgeable by feeding diverse Transformer outputs to the teacher classifier.In addition,the exit layer prediction technique is proposed to utilize a GPU hash table to handle the token-level exit layer distribution and to sort test samples by predicted exit layers.In this way,GEEP can avoid batch size shrinkage and redundant computation of student classifiers.Experimental results on twelve public English and Chinese NLP datasets prove the effectiveness of the proposed approach.The source codes of GEEP will be released to the public upon paper acceptance.

    EvolveKG:a general framework to learn evolving knowledge graphs

    Jiaqi LIUZhiwen YUBin GUOCheng DENG...
    43-59页
    查看更多>>摘要:A great many practical applications have observed knowledge evolution,i.e.,continuous born of new knowledge,with its formation influenced by the structure of historical knowledge.This observation gives rise to evolving knowledge graphs whose structure temporally grows over time.However,both the modal characterization and the algorithmic implementation of evolving knowledge graphs remain unexplored.To this end,we propose EvolveKG-a general framework that enables algorithms in the static knowledge graphs to learn the evolving ones.EvolveKG quantifies the influence of a historical fact on a current one,called the effectiveness of the fact,and makes knowledge prediction by leveraging all the cross-time knowledge interaction.The novelty of EvolveKG lies in Derivative Graph-a weighted snapshot of evolution at a certain time.Particularly,each weight quantifies knowledge effectiveness through a temporarily decaying function of consistency and attenuation,two proposed factors depicting whether or not the effectiveness of a fact fades away with time.Besides,considering both knowledge creation and loss,we obtain higher prediction accuracy when the effectiveness of all the facts increases with time or remains unchanged.Under four real datasets,the superiority of EvolveKG is confirmed in prediction accuracy.

    Unsupervised social network embedding via adaptive specific mappings

    Youming GECong HUANGYubao LIUSen ZHANG...
    61-71页
    查看更多>>摘要:In this paper,we address the problem of unsuperised social network embedding,which aims to embed network nodes,including node attributes,into a latent low dimensional space.In recent methods,the fusion mechanism of node attributes and network structure has been proposed for the problem and achieved impressive prediction performance.However,the non-linear property of node attributes and network structure is not efficiently fused in existing methods,which is potentially helpful in learning a better network embedding.To this end,in this paper,we propose a novel model called ASM(Adaptive Specific Mapping)based on encoder-decoder framework.In encoder,we use the kernel mapping to capture the non-linear property of both node attributes and network structure.In particular,we adopt two feature mapping functions,namely an untrainable function for node attributes and a trainable function for network structure.By the mapping functions,we obtain the low dimensional feature vectors for node attributes and network structure,respectively.Then,we design an attention layer to combine the learning of both feature vectors and adaptively learn the node embedding.In encoder,we adopt the component of reconstruction for the training process of learning node attributes and network structure.We conducted a set of experiments on seven real-world social network datasets.The experimental results verify the effectiveness and efficiency of our method in comparison with state-of-the-art baselines.

    Uncertain knowledge graph embedding:an effective method combining multi-relation and multi-path

    Qi LIUQinghua ZHANGFan ZHAOGuoyin WANG...
    73-89页
    查看更多>>摘要:Uncertain Knowledge Graphs(UKGs)are used to characterize the inherent uncertainty of knowledge and have a richer semantic structure than deterministic knowledge graphs.The research on the embedding of UKG has only recently begun,Uncertain Knowledge Graph Embedding(UKGE)model has a certain effect on solving this problem.However,there are still unresolved issues.On the one hand,when reasoning the confidence of unseen relation facts,the introduced probabilistic soft logic cannot be used to combine multi-path and multi-step global information,leading to information loss.On the other hand,the existing UKG embedding model can only model symmetric relation facts,but the embedding problem of asymmetric relation facts has not be addressed.To address the above issues,a Multiplex Uncertain Knowledge Graph Embedding(MUKGE)model is proposed in this paper.First,to combine multiple information and achieve more accurate results in confidence reasoning,the Uncertain ResourceRank(URR)reasoning algorithm is introduced.Second,the asymmetry in the UKG is defined.To embed asymmetric relation facts of UKG,a multi-relation embedding model is proposed.Finally,experiments are carried out on different datasets via 4 tasks to verify the effectiveness of MUKGE.The results of experiments demonstrate that MUKGE can obtain better overall performance than the baselines,and it helps advance the research on UKG embedding.

    Incorporating contextual evidence to improve implicit discourse relation recognition in Chinese

    Sheng XUPeifeng LIQiaoming ZHU
    91-104页
    查看更多>>摘要:The discourse analysis task,which focuses on understanding the semantics of long text spans,has received increasing attention in recent years.As a critical component of discourse analysis,discourse relation recognition aims to identify the rhetorical relations between adjacent discourse units(e.g.,clauses,sentences,and sentence groups),called arguments,in a document.Previous works focused on capturing the semantic interactions between arguments to recognize their discourse relations,ignoring important textual information in the surrounding contexts.However,in many cases,more than capturing semantic interactions from the texts of the two arguments are needed to identify their rhetorical relations,requiring mining more contextual clues.In this paper,we propose a method to convert the RST-style discourse trees in the training set into dependency-based trees and train a contextual evidence selector on these transformed structures.In this way,the selector can learn the ability to automatically pick critical textual information from the context(i.e.,as evidence)for arguments to assist in discriminating their relations.Then we encode the arguments concatenated with corresponding evidence to obtain the enhanced argument representations.Finally,we combine original and enhanced argument representations to recognize their relations.In addition,we introduce auxiliary tasks to guide the training of the evidence selector to strengthen its selection ability.The experimental results on the Chinese CDTB dataset show that our method outperforms several state-of-the-art baselines in both micro and macro F1 scores.

    Scattering-based hybrid network for facial attribute classification

    Na LIUFan ZHANGLiang CHANGFuqing DUAN...
    105-116页
    查看更多>>摘要:Face attribute classification(FAC)is a high-profile problem in biometric verification and face retrieval.Although recent research has been devoted to extracting more delicate image attribute features and exploiting the inter-attribute correlations,significant challenges still remain.Wavelet scattering transform(WST)is a promising non-learned feature extractor.It has been shown to yield more discriminative representations and outperforms the learned representations in certain tasks.Applied to the image classification task,WST can enhance subtle image texture information and create local deformation stability.This paper designs a scattering-based hybrid block,to incorporate frequency-domain(WST)and image-domain features in a channel attention manner(Squeeze-and-Excitation,SE),termed WS-SE block.Compared with CNN,WS-SE achieves a more efficient FAC performance and compensates for the model sensitivity of the small-scale affine transform.In addition,to further exploit the relationships among the attribute labels,we propose a learning strategy from a causal view.The cause attributes defined using the causality-related information can be utilized to infer the effect attributes with a high confidence level.Ablative analysis experiments demonstrate the effectiveness of our model,and our hybrid model obtains state-of-the-art results in two public datasets.

    Constrained clustering with weak label prior

    Jing ZHANGRuidong FANHong TAOJiacheng JIANG...
    117-132页
    查看更多>>摘要:Clustering is widely exploited in data mining.It has been proved that embedding weak label prior into clustering is effective to promote its performance.Previous researches mainly focus on only one type of prior.However,in many real scenarios,two kinds of weak label prior information,e.g.,pairwise constraints and cluster ratio,are easily obtained or already available.How to incorporate them to improve clustering performance is important but rarely studied.We propose a novel constrained Clustering with Weak Label Prior method(CWLP),which is an integrated framework.Within the unified spectral clustering model,the pairwise constraints are employed as a regularizer in spectral embedding and label proportion is added as a constraint in spectral rotation.To approximate a variant of the embedding matrix more precisely,we replace a cluster indicator matrix with its scaled version.Instead of fixing an initial similarity matrix,we propose a new similarity matrix that is more suitable for deriving clustering results.Except for the theoretical convergence and computational complexity analyses,we validate the effectiveness of CWLP through several benchmark datasets,together with its ability to discriminate suspected breast cancer patients from healthy controls.The experimental evaluation illustrates the superiority of our proposed approach.

    Group control for procedural rules:parameterized complexity and consecutive domains

    Yongjie YANGDinko DIMITROV
    133-141页
    查看更多>>摘要:We consider GROUP CONTROL BY ADDING INDIVIDUALS(GCAI)in the setting of group identification for two procedural rules—the consensus-start-respecting rule and the liberal-start-respecting rule.It is known that GCAI for both rules are NP-hard,but whether they are fixed-parameter tractable with respect to the number of distinguished individuals remained open.We resolve both open problems in the affirmative.In addition,we strengthen the NP-hardness of GCAI by showing that,with respect to the natural parameter the number of added individuals,GCAI for both rules are W[2]-hard.Notably,the W[2]-hardness for the liberal-start-respecting rule holds even when restricted to a very special case where the qualifications of individuals satisfy the so-called consecutive ones property.However,for the consensus-start-respecting rule,the problem becomes polynomial-time solvable in this special case.We also study a dual restriction where the disqualifications of individuals fulfill the consecutive ones property,and show that under this restriction GCAI for both rules turn out to be polynomial-time solvable.Our reductions for showing W[2]-hardness also imply several algorithmic lower bounds.