- Table View
- List View
Data Science: New Issues, Challenges and Applications (Studies in Computational Intelligence #869)
by Janusz Kacprzyk Gintautas Dzemyda Jolita BernatavičienėThis book contains 16 chapters by researchers working in various fields of data science. They focus on theory and applications in language technologies, optimization, computational thinking, intelligent decision support systems, decomposition of signals, model-driven development methodologies, interoperability of enterprise applications, anomaly detection in financial markets, 3D virtual reality, monitoring of environmental data, convolutional neural networks, knowledge storage, data stream classification, and security in social networking. The respective papers highlight a wealth of issues in, and applications of, data science. Modern technologies allow us to store and transfer large amounts of data quickly. They can be very diverse - images, numbers, streaming, related to human behavior and physiological parameters, etc. Whether the data is just raw numbers, crude images, or will help solve current problems and predict future developments, depends on whether we can effectively process and analyze it. Data science is evolving rapidly. However, it is still a very young field. In particular, data science is concerned with visualizations, statistics, pattern recognition, neurocomputing, image analysis, machine learning, artificial intelligence, databases and data processing, data mining, big data analytics, and knowledge discovery in databases. It also has many interfaces with optimization, block chaining, cyber-social and cyber-physical systems, Internet of Things (IoT), social computing, high-performance computing, in-memory key-value stores, cloud computing, social computing, data feeds, overlay networks, cognitive computing, crowdsource analysis, log analysis, container-based virtualization, and lifetime value modeling. Again, all of these areas are highly interrelated. In addition, data science is now expanding to new fields of application: chemical engineering, biotechnology, building energy management, materials microscopy, geographic research, learning analytics, radiology, metal design, ecosystem homeostasis investigation, and many others.
Data Science: Second International Conference, ICDS 2015, Sydney, Australia, August 8-9, 2015, Proceedings (Lecture Notes in Computer Science #9208)
by Yong Shi Yingjie Tian Philip S. Yu Peng Zhang Chengqi Zhang Wei Huang Yangyong Zhu Jing HeThis book constitutes the refereed proceedings of theSecond International Conference on Data Science, ICDS 2015, held in Sydney,Australia, during August 8-9, 2015. The 19 revised full papers and 5 short papers presentedwere carefully reviewed and selected from 31 submissions. The papers focus onthe following topics: mathematical issues in data science; big data issues andapplications; data quality and data preparation; data-driven scientificresearch; evaluation and measurement in data service; big data mining andknowledge management; case study of data science; social impacts of datascience.
Data Science: Techniques and Intelligent Applications
by Idongesit Williams Ramchandra Mangrulkar Parikshit N Mahalle Pallavi ChavanThis book covers the topic of data science in a comprehensive manner and synthesizes both fundamental and advanced topics of a research area that has now reached its maturity. The book starts with the basic concepts of data science. It highlights the types of data and their use and importance, followed by a discussion on a wide range of applications of data science and widely used techniques in data science. Key Features • Provides an internationally respected collection of scientific research methods, technologies and applications in the area of data science. • Presents predictive outcomes by applying data science techniques to real-life applications. • Provides readers with the tools, techniques and cases required to excel with modern artificial intelligence methods. • Gives the reader a variety of intelligent applications that can be designed using data science and its allied fields. The book is aimed primarily at advanced undergraduates and graduates studying machine learning and data science. Researchers and professionals will also find this book useful.
Data Science: Techniques for Excelling at Data Science
by Daniel VaughanThis practical guide provides a collection of techniques and best practices that are generally overlooked in most data engineering and data science pedagogy. A common misconception is that great data scientists are experts in the "big themes" of the discipline—machine learning and programming. But most of the time, these tools can only take us so far. In practice, the smaller tools and skills really separate a great data scientist from a not-so-great one.Taken as a whole, the lessons in this book make the difference between an average data scientist candidate and a qualified data scientist working in the field. Author Daniel Vaughan has collected, extended, and used these skills to create value and train data scientists from different companies and industries.With this book, you will:Understand how data science creates valueDeliver compelling narratives to sell your data science projectBuild a business case using unit economics principlesCreate new features for a ML model using storytellingLearn how to decompose KPIsPerform growth decompositions to find root causes for changes in a metricDaniel Vaughan is head of data at Clip, the leading paytech company in Mexico. He's the author of Analytical Skills for AI and Data Science (O'Reilly).
Data Science: Theory, Algorithms, and Applications (Transactions on Computer Systems and Networks)
by Badal Soni Gyanendra K. Verma Salah Bourennane Alexandre C. B. RamosThis book targets an audience with a basic understanding of deep learning, its architectures, and its application in the multimedia domain. Background in machine learning is helpful in exploring various aspects of deep learning. Deep learning models have a major impact on multimedia research and raised the performance bar substantially in many of the standard evaluations. Moreover, new multi-modal challenges are tackled, which older systems would not have been able to handle. However, it is very difficult to comprehend, let alone guide, the process of learning in deep neural networks, there is an air of uncertainty about exactly what and how these networks learn. By the end of the book, the readers will have an understanding of different deep learning approaches, models, pre-trained models, and familiarity with the implementation of various deep learning algorithms using various frameworks and libraries.
Data Science: Theory, Analysis and Applications
by Qurban A. Memon Shakeel Ahmed KhojaThe aim of this book is to provide an internationally respected collection of scientific research methods, technologies and applications in the area of data science. This book can prove useful to the researchers, professors, research students and practitioners as it reports novel research work on challenging topics in the area surrounding data science. In this book, some of the chapters are written in tutorial style concerning machine learning algorithms, data analysis, information design, infographics, relevant applications, etc. The book is structured as follows: • Part I: Data Science: Theory, Concepts, and Algorithms This part comprises five chapters on data Science theory, concepts, techniques and algorithms. • Part II: Data Design and Analysis This part comprises five chapters on data design and analysis. • Part III: Applications and New Trends in Data Science This part comprises four chapters on applications and new trends in data science.
Data Science: Third International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2017, Changsha, China, September 22–24, 2017, Proceedings, Part I (Communications in Computer and Information Science #727)
by Min Li Hongzhi Wang Xianhua Song Zeguang Lu Beiji Zou Wei XieWith the ever-growing power to generate, transmit and collect huge amounts of data, information overload is now an imminent problem to mankind. The overwhelming demand for information processing is not just about a better - derstanding of data, but also a better usage of data in a timely fashion. Data mining, or knowledge discovery from databases, is proposed to gain insight into aspects of dataand to help peoplemakeinformed, sensible, andbetter decisions. At present, growing attention has been paid to the study, development and - plication of data mining. As a result there is an urgent need for sophisticated techniques and tools that can handle new ?elds of data mining, e. g. , spatialdata mining, biomedical data mining, and mining on high-speed and time-variant data streams. The knowledge of data mining should also be expanded to new applications. The1stInternationalConferenceonAdvancedDataMiningandApplications (ADMA 2005) aimed to bring together the experts on data mining throughout the world. It provided a leading international forum for the dissemination of original research results in advanced data mining techniques, applications, al- rithms, software and systems, and di'erent applied disciplines. The conference attracted 539 online submissions and 63 mailing submissions from 25 di'erent countriesandareas. Allfullpaperswerepeer reviewedbyatleastthreemembers of the Program Committee composed of international experts in data mining ?elds. A total number of 100 papers were accepted for the conference. Amongst them 25 papers were selected as regular papers and 75 papers were selected as short papers, yielding a combined acceptance rate of 17%.
Data Science: Third International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2017, Changsha, China, September 22–24, 2017, Proceedings, Part II (Communications in Computer and Information Science #728)
by Qilong Han Weipeng Jing Guanglu Sun Zeguang Lu Beiji Zou Xiaoning PengThis bookconstitutes the refereed proceedings of the First National Conference on BigData Technology and Applications, BDTA 2015, held in Harbin, China, in December2015. The 26revised papers presented were carefully reviewed and selected from numeroussubmissions. The papers address issues such as the storage technology of Big Data;analysis of Big Data and data mining; visualization of Big Data; the parallelcomputing framework under Big Data; the architecture and basic theory of BigData; collection and preprocessing of Big Data; innovative applications in someareas, such as internet of things and cloud computing.
Data Science—Analytics and Applications: Proceedings of the 5th International Data Science Conference—iDSC2023
by Peter Haber Manfred Mayr Thomas J. LampoltshammerBased on the overall digitalization in all spheres of our lives, Data Science and Artificial Intelligence (AI) are nowadays cornerstones for innovation, problem solutions, and business transformation. Data, whether structured or unstructured, numerical, textual, or audiovisual, put in context with other data or analyzed and processed by smart algorithms, are the basis for intelligent concepts and practical solutions. These solutions address many application areas such as Industry 4.0, the Internet of Things (IoT), smart cities, smart energy generation, and distribution, and environmental management. Innovation dynamics and business opportunities for effective solutions for the essential societal, environmental, or health challenges, are enabled and driven by modern data science approaches.However, Data Science and Artificial Intelligence are forming a new field that needs attention and focused research. Effective data science is only achieved in a broad and diverse discourse – when data science experts cooperate tightly with application domain experts and scientists exchange views and methods with engineers and business experts. Thus, the 5th International Data Science Conference (iDSC 2023) brings together researchers, scientists, business experts, and practitioners to discuss new approaches, methods, and tools made possible by data science.
Data Security Breaches and Privacy in Europe (SpringerBriefs in Cybersecurity)
by Rebecca WongData Security Breaches and Privacy in Europe aims to consider data protection and cybersecurity issues; more specifically, it aims to provide a fruitful discussion on data security breaches. A detailed analysis of the European Data Protection framework will be examined. In particular, the Data Protection Directive 95/45/EC, the Directive on Privacy and Electronic Communications and the proposed changes under the Data Protection Regulation (data breach notifications) and its implications are considered. This is followed by an examination of the Directive on Attacks against information systems and a discussion of the proposed Cybersecurity Directive, considering its shortcomings and its effects. The author concludes by looking at whether a balance can be drawn by the current and proposed Data Protection framework to protect against data security breaches and considers what more needs to be achieved.
Data Security and Privacy Protection: Second International Conference, DSPP 2024, Xi'an, China, October 25–28, 2024, Proceedings, Part I (Lecture Notes in Computer Science #15215)
by Moti Yung Xinyi Huang Xiaofeng ChenThis book constitutes the proceedings of the 2nd International Conference on Data Security and Privacy Protection, DSPP 2024, held in Xi'an, China, during October 25-28, 2024. The 34 full papers included in this volume were carefully reviewed and selected from a total of 99 submissions. The DSPP 2024 conference promotes and stimulates discussion on the latest theories, algorithms, applications, and emerging topics on data security and privacy protection. It encourages the cross-fertilization of ideas and provides a platform for researchers, professionals, and students worldwide to discuss and present their research results.
Data Security and Privacy Protection: Second International Conference, DSPP 2024, Xi'an, China, October 25–28, 2024, Proceedings, Part II (Lecture Notes in Computer Science #15216)
by Moti Yung Xinyi Huang Xiaofeng ChenThis book constitutes the proceedings of the 2nd International Conference on Data Security and Privacy Protection, DSPP 2024, held in Xi'an, China, during October 25-28, 2024. The 34 full papers included in this volume were carefully reviewed and selected from a total of 99 submissions. The DSPP 2024 conference promotes and stimulates discussion on the latest theories, algorithms, applications, and emerging topics on data security and privacy protection. It encourages the cross-fertilization of ideas and provides a platform for researchers, professionals, and students worldwide to discuss and present their research results.
Data Security and Privacy Protection: Third International Conference, DSPP 2025, Xi'an, China, October 16–18, 2025, Proceedings, Part II (Lecture Notes in Computer Science #16177)
by Ding Wang Xiaofeng Chen Haibo HuThis book constitutes the proceedings of the 3rd International Conference on Data Security and Privacy Protection, DSPP 2025, held in Xi'an, China, during October 16–18, 2025. The 36 full papers and 11 short papers presented in these two volumes were carefully reviewed and selected from 105 submissions. The papers are organized in the following topical sections: Part I:AI and System Security; Blockchain and Related Technologies; Privacy Preserving/Enhancing Technologies; Cryptographic Primitives; Privacy-Aware Federated Learning; AI-based Security Applications and Technologies. Part II: AI-based Security Applications and Technologies; Cryptographic Protocols Design and Analysis; Model Security and Copyright Protection.
Data Security in Cloud Storage (Wireless Networks)
by Xuemin Sherman Shen Yuan Zhang Chunxiang XuThis book provides a comprehensive overview of data security in cloud storage, ranging from basic paradigms and principles, to typical security issues and practical security solutions. It also illustrates how malicious attackers benefit from the compromised security of outsourced data in cloud storage and how attacks work in real situations, together with the countermeasures used to ensure the security of outsourced data. Furthermore, the book introduces a number of emerging technologies that hold considerable potential – for example, blockchain, trusted execution environment, and indistinguishability obfuscation – and outlines open issues and future research directions in cloud storage security. The topics addressed are important for the academic community, but are also crucial for industry, since cloud storage has become a fundamental component in many applications. The book offers a general introduction for interested readers with a basic modern cryptography background, and a reference guide for researchers and practitioners in the fields of data security and cloud storage. It will also help developers and engineers understand why some current systems are insecure and inefficient, and move them to design and develop improved systems.
Data Security in Internet of Things Based RFID and WSN Systems Applications (Internet of Everything (IoE))
by Rohit Sharma Korhan Cengiz Rajendra Prasad MahapatraThis book focuses on RFID (Radio Frequency Identification), IoT (Internet of Things), and WSN (Wireless Sensor Network). It includes contributions that discuss the security and privacy issues as well as the opportunities and applications that are tightly linked to sensitive infrastructures and strategic services. This book addresses the complete functional framework and workflow in IoT-enabled RFID systems and explores basic and high-level concepts. It is based on the latest technologies and covers the major challenges, issues, and advances in the field. It presents data acquisition and case studies related to data-intensive technologies in RFID-based IoT and includes WSN-based systems and their security. It can serve as a manual for those in the industry while also helping beginners to understand both the basic and advanced aspects of IoT-based RFID-related issues. This book can be a premier interdisciplinary platform for researchers, practitioners, and educators to present and discuss the most recent innovations, trends, and concerns as well as practical challenges encountered, and find solutions that have been adopted in the fields of IoT and analytics.
Data Security: Technical and Organizational Protection Measures against Data Loss and Computer Crime
by Thomas H. LenhardUsing many practical examples and notes, the book offers an easy-to-understand introduction to technical and organizational data security. It provides an insight into the technical knowledge that is mandatory for data protection officers. Data security is an inseparable part of data protection, which is becoming more and more important in our society. It can only be implemented effectively if there is an understanding of technical interrelationships and threats. Data security covers much more information than just personal data. It secures all data and thus the continued existence of companies and organizations.This book is a translation of the original German 2nd edition Datensicherheit by Thomas H. Lenhard, published by Springer Fachmedien Wiesbaden GmbH, part of Springer Nature in 2020. The translation was done with the help of artificial intelligence (machine translation by the service DeepL.com). A subsequent human revision was done primarily in terms of content, so that the book will read stylistically differently from a conventional translation. Springer Nature works continuously to further the development of tools for the production of books and on the related technologies to support the authors.
Data Sharing für KMU: Voraussetzungen und Instrumente für die gemeinsame Nutzung von Daten
by Jürg Meierhofer Petra Kugler Martin Dobler Marc Strittmatter Manuel Treiterer Helen VogtEin bewusster Umgang mit Daten ist für Unternehmen wichtiger denn je: Sie fördern Innovation in Geschäftsmodellen, erfordern aber auch einen effizienten und nachhaltigen Datenumgang. Unternehmen, die Daten teilen und nutzen, wirtschaften effizienter durch Data Sharing. In diesem Fachbuch werden verschiedene Aspekte des Data Sharings aus unternehmensübergreifender und -interner Perspektive, vorwiegend mit Blick auf kleine und mittelständische Unternehmen (KMU), betrachtet. Die Autor:innen untersuchen, wie Unternehmen Anreize schaffen können, um erfolgreich an der Datenökonomie teilzunehmen, aber auch welche externen Bedingungen gegeben sein sollten, um Unternehmen in die Lage zu versetzen, ihre ökonomischen Potenziale in Bezug auf Daten zu maximieren.Das Buch zeigt, wie KMU den Wert ihrer Daten optimieren, vertrauensvolle Partnerschaften aufbauen, Sicherheitsbarrieren überwinden, eine datenzentrierte Kultur fördern und rechtliche Fragen auf praktische und effektive Weise regeln können. Die Autor:innen stellen umsetzbare Erkenntnisse und Beispiele aus der Praxis vor und geben KMUs die Werkzeuge an die Hand, um in einem datenzentrierten Geschäftsumfeld erfolgreich zu sein. Der Inhalt Das Data Sharing Framework: Vertrauen, Organisationskultur, Wert von Daten, Sicherheit und Recht & Governance Strategische Dimensionen, Datenbewertung in der Praxis, ESG-Berichterstattung, Monetarisierung, LLM und Datenökosysteme Fallstudien
Data Sketches: A journey of imagination, exploration, and beautiful data visualizations (AK Peters Visualization Series)
by Nadieh Bremer Shirley WuIn Data Sketches, Nadieh Bremer and Shirley Wu document the deeply creative process behind 24 unique data visualization projects, and they combine this with powerful technical insights which reveal the mindset behind coding creatively. Exploring 12 different themes – from the Olympics to Presidents & Royals and from Movies to Myths & Legends – each pair of visualizations explores different technologies and forms, blurring the boundary between visualization as an exploratory tool and an artform in its own right. This beautiful book provides an intimate, behind-the-scenes account of all 24 projects and shares the authors’ personal notes and drafts every step of the way. The book features: Detailed information on data gathering, sketching, and coding data visualizations for the web, with screenshots of works-in-progress and reproductions from the authors’ notebooks Never-before-published technical write-ups, with beginner-friendly explanations of core data visualization concepts Practical lessons based on the data and design challenges overcome during each project Full-color pages, showcasing all 24 final data visualizations This book is perfect for anyone interested or working in data visualization and information design, and especially those who want to take their work to the next level and are inspired by unique and compelling data-driven storytelling.
Data Smart
by John W. ForemanData Science gets thrown around in the press like it's magic. Major retailers are predicting everything from when their customers are pregnant to when they want a new pair of Chuck Taylors. It's a brave new world where seemingly meaningless data can be transformed into valuable insight to drive smart business decisions.But how does one exactly do data science? Do you have to hire one of these priests of the dark arts, the "data scientist," to extract this gold from your data? Nope.Data science is little more than using straight-forward steps to process raw data into actionable insight. And in Data Smart, author and data scientist John Foreman will show you how that's done within the familiar environment of a spreadsheet. Why a spreadsheet? It's comfortable! You get to look at the data every step of the way, building confidence as you learn the tricks of the trade. Plus, spreadsheets are a vendor-neutral place to learn data science without the hype. But don't let the Excel sheets fool you. This is a book for those serious about learning the analytic techniques, the math and the magic, behind big data. Each chapter will cover a different technique in a spreadsheet so you can follow along:Mathematical optimization, including non-linear programming and genetic algorithmsClustering via k-means, spherical k-means, and graph modularityData mining in graphs, such as outlier detectionSupervised AI through logistic regression, ensemble models, and bag-of-words modelsForecasting, seasonal adjustments, and prediction intervals through monte carlo simulationMoving from spreadsheets into the R programming languageYou get your hands dirty as you work alongside John through each technique. But never fear, the topics are readily applicable and the author laces humor throughout. You'll even learn what a dead squirrel has to do with optimization modeling, which you no doubt are dying to know.
Data Smart: Using Data Science to Transform Information into Insight
by Jordan GoldmeierA straightforward and engaging approach to data science that skips the jargon and focuses on the essentials In the newly revised second edition of Data Smart: Using Data Science to Transform Information into Insight, accomplished data scientist and speaker Jordan Goldmeier delivers an approachable and conversational approach to data science using Microsoft Excel’s easily understood features. The author also walks readers through the fundamentals of statistics, machine learning and powerful artificial intelligence concepts, focusing on how to learn by doing. You’ll also find: Four-color data visualizations that highlight and illustrate the concepts discussed in the book Tutorials explaining complicated data science using just Microsoft Excel How to take what you’ve learned and apply it to everyday problems at work and lifeA must-read guide to data science for every day, non-technical professionals, Data Smart will earn a place on the bookshelves of students, analysts, data-driven managers, marketers, consultants, business intelligence analysts, demand forecasters, and revenue managers.
Data Source Handbook: A Guide to Public Data
by Pete WardenIf you're a developer looking to supplement your own data tools and services, this concise ebook covers the most useful sources of public data available today. You'll find useful information on APIs that offer broad coverage, tie their data to the outside world, and are either accessible online or feature downloadable bulk data. You'll also find code and helpful links.This guide organizes APIs by the subjects they cover—such as websites, people, or places—so you can quickly locate the best resources for augmenting the data you handle in your own service. Categories include:Website tools such as WHOIS, bit.ly, and CompeteServices that use email addresses as search terms, including GithubFinding information from just a name, with APIs such as WhitePagesServices, such as Klout, for locating people with Facebook and Twitter accountsSearch APIs, including BOSS and WikipediaGeographical data sources, including SimpleGeo and U.S. CensusCompany information APIs, such as CrunchBase and ZoomInfoAPIs that list IP addresses, such as MaxMindServices that list books, films, music, and products
Data Spaces: Design, Deployment and Future Directions
by Edward Curry Simon Scerri Tuomo TuikkaThis open access book aims to educate data space designers to understand what is required to create a successful data space. It explores cutting-edge theory, technologies, methodologies, and best practices for data spaces for both industrial and personal data and provides the reader with a basis for understanding the design, deployment, and future directions of data spaces.The book captures the early lessons and experience in creating data spaces. It arranges these contributions into three parts covering design, deployment, and future directions respectively. The first part explores the design space of data spaces. The single chapters detail the organisational design for data spaces, data platforms, data governance federated learning, personal data sharing, data marketplaces, and hybrid artificial intelligence for data spaces.The second part describes the use of data spaces within real-world deployments. Its chapters are co-authored with industry experts and include case studies of data spaces in sectors including industry 4.0, food safety, FinTech, health care, and energy.The third and final part details future directions for data spaces, including challenges and opportunities for common European data spaces and privacy-preserving techniques for trustworthy data sharing.The book is of interest to two primary audiences: first, researchers interested in data management and data sharing, and second, practitioners and industry experts engaged in data-driven systems where the sharing and exchange of data within an ecosystem are critical.
Data Stewardship for Open Science: Implementing FAIR Principles
by Barend MonsData Stewardship for Open Science: Implementing FAIR Principles has been written with the intention of making scientists, funders, and innovators in all disciplines and stages of their professional activities broadly aware of the need, complexity, and challenges associated with open science, modern science communication, and data stewardship. The FAIR principles are used as a guide throughout the text, and this book should leave experimentalists consciously incompetent about data stewardship and motivated to respect data stewards as representatives of a new profession, while possibly motivating others to consider a career in the field. The ebook, avalable for no additional cost when you buy the paperback, will be updated every 6 months on average (providing that significant updates are needed or avaialble). Readers will have the opportunity to contribute material towards these updates, and to develop their own data management plans, via the free Data Stewardship Wizard.
Data Storage Architectures and Technologies
by Jiwu ShuData is a core asset in the current development of information technology and needs to be stored efficiently and reliably to serve many important real-world applications such as the Internet, big data, artificial intelligence, and high-performance computing. Generations of researchers and practitioners have continued to innovate the design of storage systems to achieve the goals of high performance, ease of use, and high reliability. This textbook provides a thorough and comprehensive introduction to the field of data storage. With 14 chapters, the book not only covers the basics of storage devices, storage arrays, storage protocols, key-value stores, file systems, network storage architecture, distributed storage systems, storage reliability, storage security, and data protection, but also provides in-depth discussions on advanced topics such as storage maintenance, storage solutions, and storage technology trends and developments (e.g., in-storage computing, persistent memory system, blockchain storage, and in-network storage system). For each section, the authors have attempted to provide the latest current academic and industry research progress that will help readers deepen their understanding and application of basic data storage concepts. This textbook is ideal for storage courses targeting upper-level undergraduate or graduate students in computer science and related disciplines. It also serves as a valuable reference for technical professionals.
Data Storage for Social Networks: A Socially Aware Approach (SpringerBriefs in Optimization)
by Duc A. TranEvidenced by the success of Facebook, Twitter, and LinkedIn, online social networks (OSNs) have become ubiquitous, offering novel ways for people to access information and communicate with each other. As the increasing popularity of social networking is undeniable, scalability is an important issue for any OSN that wants to serve a large number of users. Storing user data for the entire network on a single server can quickly lead to a bottleneck, and, consequently, more servers are needed to expand storage capacity and lower data request traffic per server. Adding more servers is just one step to address scalability. The next step is to determine how best to store the data across multiple servers. This problem has been widely-studied in the literature of distributed and database systems. OSNs, however, represent a different class of data systems. When a user spends time on a social network, the data mostly requested is her own and that of her friends; e.g., in Facebook or Twitter, these data are the status updates posted by herself as well as that posted by the friends. This so-called social locality should be taken into account when determining the server locations to store these data, so that when a user issues a read request, all its relevant data can be returned quickly and efficiently. Social locality is not a design factor in traditional storage systems where data requests are always processed independently. Even for today's OSNs, social locality is not yet considered in their data partition schemes. These schemes rely on distributed hash tables (DHT), using consistent hashing to assign the users' data to the servers. The random nature of DHT leads to weak social locality which has been shown to result in poor performance under heavy request loads. Data Storage for Social Networks: A Socially Aware Approach is aimed at reviewing the current literature of data storage for online social networks and discussing new methods that take into account social awareness in designing efficient data storage.