{"product_id":"foundations-of-data-intensive-applications-large-scale-data-analytics-under-the-hood-paperback","title":"Foundations of Data Intensive Applications: Large Scale Data Analytics Under the Hood - Paperback","description":"\u003cp\u003eby \u003cb\u003eSupun Kamburugamuve\u003c\/b\u003e (Author), \u003cb\u003eSaliya Ekanayake\u003c\/b\u003e (Author)\u003c\/p\u003e\u003cp\u003e\u003c\/p\u003e\u003cp\u003eThere is an ever increasing need to store this data, process them and incorporate the knowledge into everyday business operations of the companies. Before big data systems. there were high performance systems designed to do large calculations. Around the time big data became popular, high performance computing systems were mature enough to support the scientific community. But they weren't ready for the enterprise needs of data analytics. Because of the lack of system support for big data systems at that time, there was a large number of systems created to store and process data. These systems were created according to different design principles and some of them thrived through the years while some didn't succeed. Because of the diverse nature of systems and tools available for data analytics, there is a need to understand these systems and their applications from a theoretical perspective. These systems are masking the user from underlying details, and they use them without knowing how they work. This works for simple applications but when developing more complex applications that need to scale, users find themselves without the required foundational knowledge to reason about the issues. This knowledge is currently hidden in the systems and research papers.\u003c\/p\u003e \u003cp\u003e The underlying principles behind data processing systems originate from the parallel and distributed computing paradigms. Among the many systems and APIs for data processing, they use the same fundamental ideas under the hood with slightly different variations. We can breakdown data analytics systems according to these principles and study them to understand the inner workings of applications.\u003cbr\u003e\u003cbr\u003e This book defines these foundational components of large scale, distributed data processing systems and go into details independently of specific frameworks. It draws examples of current systems to explain how these principles are used in practice. Major design decisions around these foundational components define the performance, type of applications supported and usability. One of the goals of the book is to explain these differences so that readers can take informed decisions when developing applications. Further it will help readers to acquire in-depth knowledge and recognize problems in their applications such as performance issues, distributed operation issues, and fault tolerance aspects.\u003c\/p\u003e \u003cp\u003eThis book aims to use state of the art research when appropriate to discuss some ideas and future of data analytics tools.\u003c\/p\u003e\u003ch3\u003eFront Jacket\u003c\/h3\u003e\u003cp\u003e\u003c\/p\u003e\u003cp\u003e\u003cb\u003ePEEK \"UNDER THE HOOD\" OF BIG DATA ANALYTICS\u003c\/b\u003e\u003c\/p\u003e \u003cp\u003eThe world of big data analytics grows ever more complex. And while many people can work superficially with specific frameworks, far fewer understand the fundamental principles of large-scale, distributed data processing systems and how they operate. In Foundations of Data Intensive Applications: Large Scale Data Analytics under the Hood, renowned big-data experts and computer scientists Drs. Supun Kamburugamuve and Saliya Ekanayake deliver a practical guide to applying the principles of big data to software development for optimal performance. \u003c\/p\u003e\u003cp\u003eThe authors discuss foundational components of large-scale data systems and walk readers through the major software design decisions that define performance, application type, and usability. You ll learn how to recognize problems in your applications resulting in performance and distributed operation issues, diagnose them, and effectively eliminate them by relying on the bedrock big data principles explained within. \u003c\/p\u003e\u003cp\u003eMoving beyond individual frameworks and APIs for data processing, this book unlocks the theoretical ideas that operate under the hood of every big data processing system. \u003c\/p\u003e\u003cp\u003eIdeal for data scientists, data architects, dev-ops engineers, and developers, \u003ci\u003eFoundations of Data Intensive Applications: Large Scale Data Analytics under the Hood\u003c\/i\u003e shows readers how to: \u003c\/p\u003e\u003cul\u003e\u003cb\u003e\u003cli\u003eIdentify the foundations of large-scale, distributed data processing systems\u003c\/li\u003e \u003cli\u003eMake major software design decisions that optimize performance\u003c\/li\u003e \u003cli\u003eDiagnose performance problems and distributed operation issues\u003c\/li\u003e \u003cli\u003eUnderstand state-of-the-art research in big data\u003c\/li\u003e \u003cli\u003eExplain and use the major big data frameworks and understand what underpins them\u003c\/li\u003e \u003cli\u003eUse big data analytics in the real world to solve practical problems\u003c\/li\u003e\u003c\/b\u003e\u003c\/ul\u003e\u003ch3\u003eAuthor Biography\u003c\/h3\u003e\u003cp\u003e\u003c\/p\u003e\u003cp\u003e\u003cb\u003eSUPUN KAMBURUGAMUVE, PhD, \u003c\/b\u003e is a computer scientist researching and designing large scale data analytics tools. He received his doctorate in Computer Science from Indiana University, Bloomington and architected the data processing systems Twister2 and Cylon.\u003c\/p\u003e \u003cp\u003e\u003cb\u003eSALIYA EKANAYAKE, PhD, \u003c\/b\u003e is a Senior Software Engineer at Microsoft working in the intersection of scaling deep learning systems and parallel computing. He is also a research affiliate at Berkeley Lab. He received his doctorate in Computer Science from Indiana University, Bloomington.\u003c\/p\u003e\u003cdiv\u003e\n\u003cstrong\u003eNumber of Pages:\u003c\/strong\u003e 416\u003c\/div\u003e\u003cdiv\u003e\n\u003cstrong\u003eDimensions:\u003c\/strong\u003e 0.9 x 9.1 x 7.4 IN\u003c\/div\u003e\u003cdiv\u003e\n\u003cstrong\u003ePublication Date:\u003c\/strong\u003e September 08, 2021\u003c\/div\u003e","brand":"Books by splitShops","offers":[{"title":"Default Title","offer_id":51759251259680,"sku":"9781119713029","price":55.0,"currency_code":"USD","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0974\/9764\/5344\/files\/bfb1f026461ed74b011f7712f4441a0f.webp?v=1780146822","url":"https:\/\/ebocreations.com\/products\/foundations-of-data-intensive-applications-large-scale-data-analytics-under-the-hood-paperback","provider":"The E-Book Oasis LLC","version":"1.0","type":"link"}