Al-Sayeh et al., 2020
ViewHTML| Publication | Publication Date | Title |
|---|---|---|
| Al-Sayeh et al. | A gray-box modeling methodology for runtime prediction of apache spark jobs | |
| García-Gil et al. | A comparison on scalability for batch big data processing on Apache Spark and Apache Flink | |
| Van Aken et al. | Automatic database management system tuning through large-scale machine learning | |
| EP2811792B1 (en) | A method for operating a mobile telecommunication device | |
| CN113227998A (en) | Technology for comprehensively supporting autonomous JSON document object (AJD) cloud service | |
| Eltabakh et al. | Eagle-eyed elephant: split-oriented indexing in Hadoop | |
| Costa et al. | Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems | |
| Stadler et al. | Sparklify: A scalable software component for efficient evaluation of sparql queries over distributed rdf datasets | |
| US11301469B2 (en) | Dynamic rebuilding of query execution trees and reselection of query execution operators | |
| Chen et al. | Distributed and scalable sequential pattern mining through stream processing | |
| Michiardi et al. | Cache-based multi-query optimization for data-intensive scalable computing frameworks | |
| Gates et al. | Apache Pig's Optimizer. | |
| Hewasinghage et al. | A cost model for random access queries in document stores | |
| Dziedzic et al. | DBMS data loading: An analysis on modern hardware | |
| Boncz et al. | Advances in large-scale RDF data management | |
| Bellatreche | Optimization and tuning in data warehouses | |
| Munir et al. | A cost-based storage format selector for materialized results in big data frameworks | |
| Tinnefeld et al. | Elastic online analytical processing on ramcloud | |
| Shi et al. | Performance models of data parallel DAG workflows for large scale data analytics | |
| Arora | Improving performance of data science applications in python | |
| Sejdiu et al. | DistLODStats: Distributed computation of RDF dataset statistics | |
| Chao-Qiang et al. | RDDShare: reusing results of spark RDD | |
| Hagedorn et al. | Cost-based sharing and recycling of (intermediate) results in dataflow programs | |
| Ezzati‐Jivan et al. | Cube data model for multilevel statistics computation of live execution traces | |
| Hartig et al. | A Main Memory Index Structure to Query Linked Data. |