A Peta-Scale Data Movement and Analysis in Data Warehouse (APSDMADW)

February 23, 2026

Reading time: 6 minute

...

📝 Abstract

In this research paper so as to handle Information warehousing as well as online synthetic dispensation OLAP are necessary aspects of conclusion support which takes more and more turn into a focal point of the data source business.This paper offers an outline of information warehousing also OLAP systems with a highlighting on their latest necessities.All of us explain backside end tackle for extract clean-up and load information into an Data warehouse multi dimensional data model usual of OLAP frontend user tools for query and facts evaluation server extension for useful query dispensation and apparatus for metadata managing and for supervision the stockroom. Insights centered on complete data on customer actions manufactured goods act and souk performance are powerful advance and opposition in the internet gap .In this research conclude the company inspiration and the program and efficiency of servers working in a data warehouse through use of some new techniques and get better and efficient results. Data in petabyte scale. This test shows the data dropping rate in data warehouse. The locomotive is in creation at Yahoo! since 2007 and presently manages more than half a dozen peta bytes of data.

💡 Analysis

🇰🇷 한글로 읽기

📄 Content

International Journal of Computer Applications (0975 – 8887) Volume 151 – No.7, October 2016 1 A Peta-Scale Data Movement and Analysis in Data Warehouse (APSDMADW) Ahmed Mateen Department of Computer Science,
University of Agriculture Faisalabad, Pakistan Lareab Chaudhary Department of Computer Science,
University of Agriculture Faisalabad, Pakistan

ABSTRACT In this research paper so as to handle Information warehousing as well as on-line synthetic dispensation OLAP are necessary aspects of conclusion support, which takes more and more turn into a focal point of the data source business. This paper offers an outline of information warehousing also OLAP systems with a highlighting on their latest necessities. All of us explain backside end tackle for extract, clean-up and load information into an Data warehouse; multidimensional data model usual of OLAP; front-end user tools for query and facts evaluation server extension for useful query dispensation; and apparatus for metadata managing and for supervision the stockroom. Insights centered on complete data on customer actions manufactured goods act and souk performance are powerful advance and opposition in the internet gap .In this research, conclude the company inspiration and the program and efficiency of server’s working in a data warehouse through use of some new techniques and get better and efficient results. Data in peta- byte scale. This test shows the data dropping rate in data warehouse. The locomotive is in creation at Yahoo! since 2007 and presently manages more than half a dozen peta bytes of data.
Keywords Data warehouse, OLAP, analytical, Map Reduce, volume, MOPS, VO, CEDPS, function, processing

INTRODUCTION The quantity of data generate day by day in the globe is explosion. The rising size of digital and social media and internet of things be fueling it yet promote. The price of numbers growth is astounding and this data come at a velocity, with range.
Not essentially prearranged) and contain prosperity of information that preserve be a key for in advance an edging in challenging business [1]. The motivation to create MapReduce came from Google’s need to process huge amount of records across a network of computers. In order to do this effectively the solution would have to handle scheduling details, while empowering the user to only write the application code for a given assignment [11].Google’s MapReduce is implement in the C++ encoding verbal communication It’s take a place of input proceedings with apply a map function to every of them. The map task is defined by the programmer and it outputs a list of intermediate records – the participation used for the decrease function.
1.1 Scheduling  The servers that execute the processing defined by the map and decrease function be selected automatically by the middleware.  This reduces the volume of data transfers and enables efficient processing.  Synchronized processing: data transfers between servers for the map and reduce functions are synchronized.  Fault tolerance: To ensure that processing can continue overall even when several servers have failed, data backups and intermediate processing results are stored automatically [2].
PREVIOUS WORK The data movement has no authentication and reduces data. The data is not working efficiently there are much wastage of time in sending and receiving data. In servers data in a queue and prioritized data is send firstly so leas priority data some time wastage and not reached to the destination. So used new approaches to reduce the data wastage and provide efficient performance. But now the use of new techniques which is less reduces the data wastage and data surely reached at his place. Data provide High speed, Tera-scale, troubleshooting the intricate end-to-end framework, Constructing and working versatile administrations, Securing the end-to-end framework.
MATERIAL AND METHODS 3.1 Proposed approach • High-speed solid information situation, to exchange information from its site of creation to different areas for ensuing examination. • Tera-scale or speedier nearby information examination, to empower investigation of information that has been brought locally [3]. • High-execution representation, to empower scrutiny of chose subsets and elements of substantial datasets information preceding download. • Troubleshooting the intricate end-to-end framework, which because of its bunch equipment and programming parts can come up short in an extensive variety of frequently difficult to-analyze ways[4]. Different new techniques review to get better result. Scapacity = CSSD/dude • (λa + numchkpts • λc)
titer
The SSD drive itself will be picked in view of numerous components, for example, IOPS/ $, GB/ $, BW/ $, compose perseverance, and CSSD..
Sbandwidth = N/BWPFS• BWhost2ssd
International Journal of Computer Applications (0975

View Original ArXiv

This content is AI-processed based on ArXiv data.

A Peta-Scale Data Movement and Analysis in Data Warehouse (APSDMADW)

📝 Abstract

💡 Analysis

📄 Content

Table of Contents

Table of Contents

📝 Abstract

💡 Analysis

📄 Content

Start searching

No results found