Computer Science / Distributed Computing

Data-Intensive Workload Consolidation on Hadoop Distributed File System

February 23, 2026

Reading time: 3 minute

...

#Distributed Computing #Computer Science #Data #System

📝 Original Info

Title: Data-Intensive Workload Consolidation on Hadoop Distributed File System
ArXiv ID: 1303.7270
Date: 2016-11-15
Authors: Researchers from original ArXiv paper

📝 Abstract

Workload consolidation, sharing physical resources among multiple workloads, is a promising technique to save cost and energy in cluster computing systems. This paper highlights a few challenges of workload consolidation for Hadoop as one of the current state-of-the-art data-intensive cluster computing system. Through a systematic step-by-step procedure, we investigate challenges for efficient server consolidation in Hadoop environments. To this end, we first investigate the inter-relationship between last level cache (LLC) contention and throughput degradation for consolidated workloads on a single physical server employing Hadoop distributed file system (HDFS). We then investigate the general case of consolidation on multiple physical servers so that their throughput never falls below a desired/predefined utilization level. We use our empirical results to model consolidation as a classic two-dimensional bin packing problem and then design a computationally efficient greedy algorithm to achieve minimum throughput degradation on multiple servers. Results are very promising and show that our greedy approach is able to achieve near optimal solution in all experimented cases.

💡 Deep Analysis

Deep Dive into Data-Intensive Workload Consolidation on Hadoop Distributed File System.

📄 Full Content

🇰🇷 이 논문을 한글로 읽기

📄 Read Full PDF on ArXiv

Reference

This content is AI-processed based on ArXiv data.

Data-Intensive Workload Consolidation on Hadoop Distributed File System

📝 Original Info

📝 Abstract

💡 Deep Analysis

📄 Full Content

Reference

Table of Contents

Table of Contents

📝 Original Info

📝 Abstract

💡 Deep Analysis

📄 Full Content

Reference

Related Posts

Reputation Algebra for Cloud-based Anonymous Data Storage Systems

A Review On Securing Distributed Systems Using Symmetric Key Cryptography

Possibilistic Assumption based Truth Maintenance System, Validation in a Data Fusion Application

Start searching

No results found