Browse Results

Showing 28,976 through 29,000 of 37,951 results

Data Management, Analytics and Innovation: Proceedings of ICDMAI 2019, Volume 1 (Advances in Intelligent Systems and Computing #1042)

by Valentina Emilia Balas Amlan Chakrabarti Neha Sharma

This book presents the latest findings in the areas of data management and smart computing, big data management, artificial intelligence and data analytics, along with advances in network technologies. It addresses state-of-the-art topics and discusses challenges and solutions for future development. Gathering original, unpublished contributions by scientists from around the globe, the book is mainly intended for a professional audience of researchers and practitioners in academia and industry.

Data Management, Analytics and Innovation: Proceedings of ICDMAI 2020, Volume 1 (Advances in Intelligent Systems and Computing #1174)

by Valentina Emilia Balas Amlan Chakrabarti Neha Sharma Jan Martinovic

This book presents the latest findings in the areas of data management and smart computing, big data management, artificial intelligence and data analytics, along with advances in network technologies. Gathering peer-reviewed research papers presented at the Fourth International Conference on Data Management, Analytics and Innovation (ICDMAI 2020), held on 17–19 January 2020 at the United Services Institute (USI), New Delhi, India, it addresses cutting-edge topics and discusses challenges and solutions for future development. Featuring original, unpublished contributions by respected experts from around the globe, the book is mainly intended for a professional audience of researchers and practitioners in academia and industry.

Data Management, Analytics and Innovation: Proceedings of ICDMAI 2020, Volume 2 (Advances in Intelligent Systems and Computing #1175)

by Valentina Emilia Balas Amlan Chakrabarti Neha Sharma Jan Martinovic

This book presents the latest findings in the areas of data management and smart computing, big data management, artificial intelligence and data analytics, along with advances in network technologies. Gathering peer-reviewed research papers presented at the Fourth International Conference on Data Management, Analytics and Innovation (ICDMAI 2020), held on 17–19 January 2020 at the United Services Institute (USI), New Delhi, India, it addresses cutting-edge topics and discusses challenges and solutions for future development. Featuring original, unpublished contributions by respected experts from around the globe, the book is mainly intended for a professional audience of researchers and practitioners in academia and industry.

Data Made Flesh: Embodying Information

by Robert Mitchell Phillip Thurtle

In an age of cloning, cyborgs, and biotechnology, the line between bodies and bytes seems to be disappearing. Data Made Flesh is the first collection to address the increasingly important links between information and embodiment, at a moment when we are routinely tempted, in the words of Donna Haraway, "to be raptured out of the bodies that matter in the lust for information," whether in the rush to complete the Human Genome Project or in the race to clone a human being.

Data Love: The Seduction and Betrayal of Digital Technologies

by Roberto Simanowski

Intelligence services, government administrations, businesses, and a growing majority of the population are hooked on the idea that big data can reveal patterns and correlations in everyday life. Initiated by software engineers and carried out through algorithms, the mining of big data has sparked a silent revolution. But algorithmic analysis and data mining are not simply byproducts of media development or the logical consequences of computation. They are the radicalization of the Enlightenment's quest for knowledge and progress. Data Love argues that the "cold civil war" of big data is taking place not among citizens or between the citizen and government but within each of us.Roberto Simanowski elaborates on the changes data love has brought to the human condition while exploring the entanglements of those who—out of stinginess, convenience, ignorance, narcissism, or passion—contribute to the amassing of ever more data about their lives, leading to the statistical evaluation and individual profiling of their selves. Writing from a philosophical standpoint, Simanowski illustrates the social implications of technological development and retrieves the concepts, events, and cultural artifacts of past centuries to help decode the programming of our present.

Data Lakes

by Dominique Laurent Anne Laurent Cédrine Madera

The concept of a data lake is less than 10 years old, but they are already hugely implemented within large companies. Their goal is to efficiently deal with ever-growing volumes of heterogeneous data, while also facing various sophisticated user needs. However, defining and building a data lake is still a challenge, as no consensus has been reached so far. Data Lakes presents recent outcomes and trends in the field of data repositories. The main topics discussed are the data-driven architecture of a data lake; the management of metadata – supplying key information about the stored data, master data and reference data; the roles of linked data and fog computing in a data lake ecosystem; and how gravity principles apply in the context of data lakes. A variety of case studies are also presented, thus providing the reader with practical examples of data lake management.

Data Lake for Enterprises

by Pankaj Misra Tomcy John

A practical guide to implementing your enterprise data lake using Lambda Architecture as the base About This Book • Build a full-fledged data lake for your organization with popular big data technologies using the Lambda architecture as the base • Delve into the big data technologies required to meet modern day business strategies • A highly practical guide to implementing enterprise data lakes with lots of examples and real-world use-cases Who This Book Is For Java developers and architects who would like to implement a data lake for their enterprise will find this book useful. If you want to get hands-on experience with the Lambda Architecture and big data technologies by implementing a practical solution using these technologies, this book will also help you. What You Will Learn • Build an enterprise-level data lake using the relevant big data technologies • Understand the core of the Lambda architecture and how to apply it in an enterprise • Learn the technical details around Sqoop and its functionalities • Integrate Kafka with Hadoop components to acquire enterprise data • Use flume with streaming technologies for stream-based processing • Understand stream- based processing with reference to Apache Spark Streaming • Incorporate Hadoop components and know the advantages they provide for enterprise data lakes • Build fast, streaming, and high-performance applications using ElasticSearch • Make your data ingestion process consistent across various data formats with configurability • Process your data to derive intelligence using machine learning algorithms In Detail The term "Data Lake" has recently emerged as a prominent term in the big data industry. Data scientists can make use of it in deriving meaningful insights that can be used by businesses to redefine or transform the way they operate. Lambda architecture is also emerging as one of the very eminent patterns in the big data landscape, as it not only helps to derive useful information from historical data but also correlates real-time data to enable business to take critical decisions. This book tries to bring these two important aspects — data lake and lambda architecture—together. This book is divided into three main sections. The first introduces you to the concept of data lakes, the importance of data lakes in enterprises, and getting you up-to-speed with the Lambda architecture. The second section delves into the principal components of building a data lake using the Lambda architecture. It introduces you to popular big data technologies such as Apache Hadoop, Spark, Sqoop, Flume, and ElasticSearch. The third section is a highly practical demonstration of putting it all together, and shows you how an enterprise data lake can be implemented, along with several real-world use-cases. It also shows you how other peripheral components can be added to the lake to make it more efficient. By the end of this book, you will be able to choose the right big data technologies using the lambda architectural patterns to build your enterprise data lake. Style and approach The book takes a pragmatic approach, showing ways to leverage big data technologies and lambda architecture to build an enterprise-level data lake.

Data Lake Development with Big Data

by Pradeep Pasupuleti Beulah Salome Purra

Explore architectural approaches to building Data Lakes that ingest, index, manage, and analyze massive amounts of data using Big Data technologies About This Book * Comprehend the intricacies of architecting a Data Lake and build a data strategy around your current data architecture * Efficiently manage vast amounts of data and deliver it to multiple applications and systems with a high degree of performance and scalability * Packed with industry best practices and use-case scenarios to get you up-and-running Who This Book Is For This book is for architects and senior managers who are responsible for building a strategy around their current data architecture, helping them identify the need for a Data Lake implementation in an enterprise context. The reader will need a good knowledge of master data management, information lifecycle management, data governance, data product design, data engineering, and systems architecture. Also required is experience of Big Data technologies such as Hadoop, Spark, Splunk, and Storm. What You Will Learn * Identify the need for a Data Lake in your enterprise context and learn to architect a Data Lake * Learn to build various tiers of a Data Lake, such as data intake, management, consumption, and governance, with a focus on practical implementation scenarios * Find out the key considerations to be taken into account while building each tier of the Data Lake * Understand Hadoop-oriented data transfer mechanism to ingest data in batch, micro-batch, and real-time modes * Explore various data integration needs and learn how to perform data enrichment and data transformations using Big Data technologies * Enable data discovery on the Data Lake to allow users to discover the data * Discover how data is packaged and provisioned for consumption * Comprehend the importance of including data governance disciplines while building a Data Lake In Detail A Data Lake is a highly scalable platform for storing huge volumes of multistructured data from disparate sources with centralized data management services. It eliminates the need for up-front modeling and rigid data structures by allowing schema-less writes. Data Lakes make it possible to ask complex far-reaching questions to find out hidden data patterns and relationships. This book explores the potential of Data Lakes and explores architectural approaches to building data lakes that ingest, index, manage, and analyze massive amounts of data using batch and real-time processing frameworks. It guides you on how to go about building a Data Lake that is managed by Hadoop and accessed as required by other Big Data applications such as Spark, Storm, Hive, and so on, to create an environment in which data from different sources can be meaningfully brought together and analyzed. Data Lakes can be viewed as having three capabilities--intake, management, and consumption. This book will take readers through each of these processes of developing a Data Lake and guide them (using best practices) in developing these capabilities. It will also explore often ignored, yet crucial considerations while building Data Lakes, with the focus on how to architect data governance, security, data quality, data lineage tracking, metadata management, and semantic data tagging. By the end of this book, you will have a good understanding of building a Data Lake for Big Data. You will be able to utilize Data Lakes for efficient and easy data processing and analytics. Style and approach Data Lake Development with Big Data provides architectural approaches to building a Data Lake. It follows a use case-based approach where practical implementation scenarios of each key component are explained. It also helps you understand how these use cases are implemented in a Data Lake. The chapters are organized in a way that mimics the sequential data flow evidenced in a Data Lake.

Data Lake Analytics on Microsoft Azure: A Practitioner's Guide to Big Data Engineering

by Harsh Chawla Pankaj Khattar

Get a 360-degree view of how the journey of data analytics solutions has evolved from monolithic data stores and enterprise data warehouses to data lakes and modern data warehouses. You willThis book includes comprehensive coverage of how:To architect data lake analytics solutions by choosing suitable technologies available on Microsoft AzureThe advent of microservices applications covering ecommerce or modern solutions built on IoT and how real-time streaming data has completely disrupted this ecosystemThese data analytics solutions have been transformed from solely understanding the trends from historical data to building predictions by infusing machine learning technologies into the solutionsData platform professionals who have been working on relational data stores, non-relational data stores, and big data technologies will find the content in this book useful. The book also can help you start your journey into the data engineer world as it provides an overview of advanced data analytics and touches on data science concepts and various artificial intelligence and machine learning technologies available on Microsoft Azure.What Will You LearnYou will understand the:Concepts of data lake analytics, the modern data warehouse, and advanced data analyticsArchitecture patterns of the modern data warehouse and advanced data analytics solutionsPhases—such as Data Ingestion, Store, Prep and Train, and Model and Serve—of data analytics solutions and technology choices available on Azure under each phaseIn-depth coverage of real-time and batch mode data analytics solutions architectureVarious managed services available on Azure such as Synapse analytics, event hubs, Stream analytics, CosmosDB, and managed Hadoop services such as Databricks and HDInsightWho This Book Is ForData platform professionals, database architects, engineers, and solution architects

Data Jujitsu: The Art of Turning Data into Product

by D. J. Patil

Acclaimed data scientist DJ Patil details a new approach to solving problems in Data Jujitsu.Learn how to use a problem's "weight" against itself to: Break down seemingly complex data problems into simplified parts; Use alternative data analysis techniques to examine them; Use human input, such as Mechanical Turk, and design tricks that enlist the help of your users to take short cuts around tough problems; Learn more about the problems before starting on the solutions--and use the findings to solve them, or determine whether the problems are worth solving at all.

The Data Journalism Handbook

by Jonathan Gray Lucy Chambers Liliana Bounegru

<p>When you combine the sheer scale and range of digital information now available with a journalist&#8217;s "nose for news" and her ability to tell a compelling story, a new world of possibility opens up. With <i>The Data Journalism Handbook</i>, you&#8217;ll explore the potential, limits, and applied uses of this new and fascinating field.</p>


by Steve Lohr

By one estimate, 90 percent of all of the data in history was created in the last two years. In 2014, International Data Corporation calculated the data universe at 4.4 zettabytes, or 4.4 trillion gigabytes. That much information, in volume, could fill enough slender iPad Air tablets to create a stack two-thirds of the way to the moon. Now, that's Big Data.Coal, iron ore, and oil were the key productive assets that fueled the Industrial Revolution. The vital raw material of today's information economy is data.In Data-ism, New York Times reporter Steve Lohr explains how big-data technology is ushering in a revolution in proportions that promise to be the basis of the next wave of efficiency and innovation across the economy. But more is at work here than technology. Big data is also the vehicle for a point of view, or philosophy, about how decisions will be--and perhaps should be--made in the future. Lohr investigates the benefits of data while also examining its dark side. Data-ism is about this next phase, in which vast Internet-scale data sets are used for discovery and prediction in virtually every field. It shows how this new revolution will change decision making--by relying more on data and analysis, and less on intuition and experience--and transform the nature of leadership and management. Focusing on young entrepreneurs at the forefront of data science as well as on giant companies such as IBM that are making big bets on data science for the future of their businesses, Data-ism is a field guide to what is ahead, explaining how individuals and institutions will need to exploit, protect, and manage data to stay competitive in the coming years. With rich examples of how the rise of big data is affecting everyday life, Data-ism also raises provocative questions about policy and practice that have wide implications for everyone.The age of data-ism is here. But are we ready to handle its consequences, good and bad?

Data-ism: Inside the Big Data Revolution

by Steve Lohr

Coal, iron ore and oil were the fuel of the Industrial Revolution. Today's economies and governments are powered by something far less tangible: the explosive abundance of digital data.Steve Lohr, the New York Times' chief technology reporter, charts the ascent of Data-ism, the dominating philosophy of the day in which data is at the forefront of everything and decisions of all kinds are based on data analysis rather than experience and intuition. Taking us behind the scenes and introducing the DOPs (Data Oriented-People), the key personalities behind this revolution, he reveals how consuming the bits and bytes of the masses is transforming the nature of business and governance in unforeseen ways. But what are losing in the process and what new dangers await?

Data-Intensive Science (Chapman And Hall/crc Computational Science Ser. #18)

by Terence Critchlow Kerstin Kleese Van Dam

Data-intensive science has the potential to transform scientific research and quickly translate scientific progress into complete solutions, policies, and economic success. But this collaborative science is still lacking the effective access and exchange of knowledge among scientists, researchers, and policy makers across a range of disciplines. Bringing together leaders from multiple scientific disciplines, Data-Intensive Science shows how a comprehensive integration of various techniques and technological advances can effectively harness the vast amount of data being generated and significantly accelerate scientific progress to address some of the world's most challenging problems. In the book, a diverse cross-section of application, computer, and data scientists explores the impact of data-intensive science on current research and describes emerging technologies that will enable future scientific breakthroughs. The book identifies best practices used to tackle challenges facing data-intensive science as well as gaps in these approaches. It also focuses on the integration of data-intensive science into standard research practice, explaining how components in the data-intensive science environment need to work together to provide the necessary infrastructure for community-scale scientific collaborations. Organizing the material based on a high-level, data-intensive science workflow, this book provides an understanding of the scientific problems that would benefit from collaborative research, the current capabilities of data-intensive science, and the solutions to enable the next round of scientific advancements.

Data Intensive Computing for Biodiversity (Studies in Computational Intelligence #485)

by Sarinder K. Dhillon Amandeep S. Sidhu

This book is focused on the development of a data integration framework for retrieval of biodiversity information from heterogeneous and distributed data sources. The data integration system proposed in this book links remote databases in a networked environment, supports heterogeneous databases and data formats, links databases hosted on multiple platforms, and provides data security for database owners by allowing them to keep and maintain their own data and to choose information to be shared and linked. The book is a useful guide for researchers, practitioners, and graduate-level students interested in learning state-of-the-art development for data integration in biodiversity.

Data-Intensive Computing

by Ian Gorton Deborah K. Gracio

The world is awash with digital data from social networks, blogs, business, science and engineering. Data-intensive computing facilitates understanding of complex problems that must process massive amounts of data. Through the development of new classes of software, algorithms and hardware, data-intensive applications can provide timely and meaningful analytical results in response to exponentially growing data complexity and associated analysis requirements. This emerging area brings many challenges that are different from traditional high-performance computing. This reference for computing professionals and researchers describes the dimensions of the field, the key challenges, the state of the art and the characteristics of likely approaches that future data-intensive problems will require. Chapters cover general principles and methods for designing such systems and for managing and analyzing the big data sets of today that live in the cloud and describe example applications in bioinformatics and cybersecurity that illustrate these principles in practice.

Data Intelligence and Cognitive Informatics: Proceedings of ICDICI 2020 (Algorithms for Intelligent Systems)

by Selwyn Piramuthu I. Jeena Jacob Selvanayaki Kolandapalayam Shanmugam Przemyslaw Falkowski-Gilski

This book discusses new cognitive informatics tools, algorithms and methods that mimic the mechanisms of the human brain which lead to an impending revolution in understating a large amount of data generated by various smart applications. The book is a collection of peer-reviewed best selected research papers presented at the International Conference on Data Intelligence and Cognitive Informatics (ICDICI 2020), organized by SCAD College of Engineering and Technology, Tirunelveli, India, during 8–9 July 2020. The book includes novel work in data intelligence domain which combines with the increasing efforts of artificial intelligence, machine learning, deep learning and cognitive science to study and develop a deeper understanding of the information processing systems.

Data Integration Life Cycle Management with SSIS: A Short Introduction By Example

by Andy Leonard

Build a custom BimlExpress framework that generates dozens of SQL Server Integration Services (SSIS) packages in minutes. Use this framework to execute related SSIS packages in a single command. You will learn to configure SSIS catalog projects, manage catalog deployments, and monitor SSIS catalog execution and history. Data Integration Life Cycle Management with SSIS shows you how to bring DevOps benefits to SSIS integration projects. Practices in this book enable faster time to market, higher quality of code, and repeatable automation. Code will be created that is easier to support and maintain. The book teaches you how to more effectively manage SSIS in the enterprise environment by drawing on the art and science of modern DevOps practices. What You'll Learn Generate dozens of SSIS packages in minutes to speed your integration projects Reduce the execution of related groups of SSIS packages to a single command Successfully handle SSIS catalog deployments and their projects Monitor the execution and history of SSIS catalog projects Manage your enterprise data integration life cycle through automated tools and utilities Who This Book Is For Database professionals working with SQL Server Integration Services in enterprise environments. The book is especially useful to those readers following, or wishing to follow, DevOps practices in their use of SSIS.

Data Integration in the Life Sciences: 11th International Conference, DILS 2015, Los Angeles, CA, USA, July 9-10, 2015, Proceedings (Lecture Notes in Computer Science #9162)

by Naveen Ashish Jose-Luis Ambite

This book constitutes the proceedings of the 11th International Conference on Data Integration in the Life Sciences, DILS 2015, held in Los Angeles, CA, USA, in July 2015. The 24 papers presented in this volume were carefully reviewed and selected from 40 submissions. They are organized in topical sections named: data integration technologies; ontology and knowledge engineering for data integration; biomedical data standards and coding; medical research applications; and graduate student consortium.

Data Integration in the Life Sciences: 12th International Conference, DILS 2017, Luxembourg, Luxembourg, November 14-15, 2017, Proceedings (Lecture Notes in Computer Science #10649)

by Marcos Da Silveira Cédric Pruski Reinhard Schneider

This book constitutes the proceedings of the 12th International Conference on Data Integration in the Life Sciences, DILS 2017, held in Luxembourg, in November 2017. The 5 full papers and 5 short papers presented in this volume were carefully reviewed and selected from 16 submissions. They cover topics such as: life science data modelling; analysing, indexing, and querying life sciences datasets; annotating, matching, and sharing life sciences datasets; privacy and provenance of life sciences datasets.

Data Integration in the Life Sciences: 13th International Conference, Dils 2018, Hannover, Germany, November 20-21, 2018, Proceedings (Lecture Notes in Computer Science #11371)

by Maria-Esther Vidal Sören Auer

This book constitutes revised selected papers from the 13th International Conference on Data Integration in the Life Sciences, DILS 2018, held in Hannover, Germany, in November 2018. The 5 full, 8 short, 3 poster and 4 demo papers presented in this volume were carefully reviewed and selected from 22 submissions. The papers are organized in topical sections named: big biomedical data integration and management; data exploration in the life sciences; biomedical data analytics; and big biomedical applications.

Data Infrastructure Management: Insights and Strategies

by Greg Schulz

This book looks at various application and data demand drivers, along with data infrastructure options from legacy on premise, public cloud, hybrid, software-defined data center (SDDC), software data infrastructure (SDI), container as well as serverless along with infrastructure as a Service (IaaS), IT as a Service (ITaaS) along with related technology, trends, tools, techniques and strategies. Filled with example scenarios, tips and strategy considerations, the book covers frequently asked questions and answers to aid strategy as well as decision-making.

Data for the Public Good: How Data Can Help Citizens and Government

by Alex Howard

As we move into an era of unprecedented volumes of data and computing power, the benefits aren't for business alone. Data can help citizens access government, hold it accountable and build new services to help themselves. Simply making data available is not sufficient. The use of data for the public good is being driven by a distributed community of media, nonprofits, academics and civic advocates.This report from O'Reilly Radar highlights the principles of data in the public good, and surveys areas where data is already being used to great effect, covering: Consumer finance; Transit data; Government transparency; Data journalism; Aid and development; Crisis and emergency response; and Healthcare.

Data for the People: How to Make Our Post-Privacy Economy Work for You

by Andreas Weigend

A long-time chief data scientist at Amazon shows how open data can make everyone, not just corporations, richerEvery time we Google something, Facebook someone, Uber somewhere, or even just turn on a light, we create data that businesses collect and use to make decisions about us. In many ways this has improved our lives, yet, we as individuals do not benefit from this wealth of data as much as we could. Moreover, whether it is a bank evaluating our credit worthiness, an insurance company determining our risk level, or a potential employer deciding whether we get a job, it is likely that this data will be used against us rather than for us.In Data for the People, Andreas Weigend draws on his years as a consultant for commerce, education, healthcare, travel and finance companies to outline how Big Data can work better for all of us. As of today, how much we benefit from Big Data depends on how closely the interests of big companies align with our own. Too often, outdated standards of control and privacy force us into unfair contracts with data companies, but it doesn't have to be this way. Weigend makes a powerful argument that we need to take control of how our data is used to actually make it work for us. Only then can we the people get back more from Big Data than we give it.Big Data is here to stay. Now is the time to find out how we can be empowered by it.

Data Fluency

by Richard Galentino Zach Gemignani Patrick Schuermann Chris Gemignani

A dream come true for those looking to improve their data fluencyAnalytical data is a powerful tool for growing companies, but what good is it if it hides in the shadows? Bring your data to the forefront with effective visualization and communication approaches, and let Data Fluency: Empowering Your Organization with Effective Communication show you the best tools and strategies for getting the job done right. Learn the best practices of data presentation and the ways that reporting and dashboards can help organizations effectively gauge performance, identify areas for improvement, and communicate results.Topics covered in the book include data reporting and communication, audience and user needs, data presentation tools, layout and styling, and common design failures. Those responsible for analytics, reporting, or BI implementation will find a refreshing take on data and visualization in this resource, as will report, data visualization, and dashboard designers.Conquer the challenge of making valuable data approachable and easy to understandDevelop unique skills required to shape data to the needs of different audiencesFull color book links to bonus content at juiceanalytics.comWritten by well-known and highly esteemed authors in the data presentation communityData Fluency: Empowering Your Organization with Effective Communication focuses on user experience, making reports approachable, and presenting data in a compelling, inspiring way. The book helps to dissolve the disconnect between your data and those who might use it and can help make an impact on the people who are most affected by data. Use Data Fluency today to develop the skills necessary to turn data into effective displays for decision-making.

Refine Search

Showing 28,976 through 29,000 of 37,951 results