Zhou et al., 2019
ViewPDF| Publication | Publication Date | Title |
|---|---|---|
| Crankshaw et al. | InferLine: latency-aware provisioning and scaling for prediction serving pipelines | |
| CN105956021B (en) | A kind of automation task suitable for distributed machines study parallel method and its system | |
| Jiang et al. | Heterogeneity-aware distributed parameter servers | |
| Gu et al. | SHadoop: Improving MapReduce performance by optimizing job execution mechanism in Hadoop clusters | |
| Zhang et al. | An adaptive synchronous parallel strategy for distributed machine learning | |
| CN111079921A (en) | Efficient neural network training and scheduling method based on heterogeneous distributed system | |
| Xie et al. | Elan: Towards generic and efficient elastic training for deep learning | |
| CN110705716A (en) | Multi-model parallel training method | |
| Geng et al. | Interference-aware parallelization for deep learning workload in GPU cluster | |
| Zhou et al. | Scheduling-efficient framework for neural network on heterogeneous distributed systems and mobile edge computing systems | |
| Shu et al. | Resource demand prediction of cloud workloads using an attention-based GRU model | |
| Cao et al. | SAP-SGD: Accelerating distributed parallel training with high communication efficiency on heterogeneous clusters | |
| Yu et al. | Accelerating distributed training in heterogeneous clusters via a straggler-aware parameter server | |
| Fan et al. | Model aggregation method for data parallelism in distributed real-time machine learning of smart sensing equipment | |
| Wang et al. | OrientStream: A framework for dynamic resource allocation in distributed data stream management systems | |
| Adefemi | What Every Computer Scientist Needs To Know About Parallelization | |
| Yang et al. | Parameter communication consistency model for large-scale security monitoring based on mobile computing | |
| Li et al. | Scheduling distributed deep learning jobs in heterogeneous cluster with placement awareness | |
| Brown et al. | An Experiment in Knowledge-Based Signal Understanding Using Parallel Architectures. | |
| Ji et al. | EP4DDL: addressing straggler problem in heterogeneous distributed deep learning. | |
| Koch et al. | SMiPE: estimating the progress of recurring iterative distributed dataflows | |
| Yao et al. | LBB: load-balanced batching for efficient distributed learning on heterogeneous GPU cluster: F. Yao et al. | |
| Li et al. | Optimizing machine learning on apache spark in HPC environments | |
| Wang et al. | SingleCaffe: an efficient framework for deep learning on a single node | |
| Nylander et al. | Towards performance modeling of speculative execution for cloud applications |