IBM Spectrum Scale: Big Data and Analytics Solution Brief

IBM Spectrum Scale: Big Data and Analytics Solution Brief
Title IBM Spectrum Scale: Big Data and Analytics Solution Brief PDF eBook
Author Wei G. Gong
Publisher IBM Redbooks
Total Pages 14
Release 2018-01-23
Genre Computers
ISBN 0738456632

Download IBM Spectrum Scale: Big Data and Analytics Solution Brief Book in PDF, Epub and Kindle

This IBM® RedguideTM publication describes big data and analytics deployments that are built on IBM Spectrum ScaleTM. IBM Spectrum Scale is a proven enterprise-level distributed file system that is a high-performance and cost-effective alternative to Hadoop Distributed File System (HDFS) for Hadoop analytics services. IBM Spectrum Scale includes NFS, SMB, and Object services and meets the performance that is required by many industry workloads, such as technical computing, big data, analytics, and content management. IBM Spectrum Scale provides world-class, web-based storage management with extreme scalability, flash accelerated performance, and automatic policy-based storage tiering from flash through disk to the cloud, which reduces storage costs up to 90% while improving security and management efficiency in cloud, big data, and analytics environments. This Redguide publication is intended for technical professionals (analytics consultants, technical support staff, IT Architects, and IT Specialists) who are responsible for providing Hadoop analytics services and are interested in learning about the benefits of the use of IBM Spectrum Scale as an alternative to HDFS.

IBM Spectrum Scale

IBM Spectrum Scale
Title IBM Spectrum Scale PDF eBook
Author Wei G. Gong
Publisher
Total Pages
Release 2019
Genre Big data
ISBN

Download IBM Spectrum Scale Book in PDF, Epub and Kindle

Enabling Hybrid Cloud Storage for IBM Spectrum Scale Using Transparent Cloud Tiering

Enabling Hybrid Cloud Storage for IBM Spectrum Scale Using Transparent Cloud Tiering
Title Enabling Hybrid Cloud Storage for IBM Spectrum Scale Using Transparent Cloud Tiering PDF eBook
Author Nikhil Khandelwal
Publisher IBM Redbooks
Total Pages 44
Release 2018-05-31
Genre Computers
ISBN 0738456861

Download Enabling Hybrid Cloud Storage for IBM Spectrum Scale Using Transparent Cloud Tiering Book in PDF, Epub and Kindle

This IBM® Redbooks® publication provides information to help you with the sizing, configuration, and monitoring of hybrid cloud solutions using the transparent cloud tiering (TCT) functionality of IBM SpectrumTM Scale. IBM Spectrum ScaleTM is a scalable data, file, and object management solution that provides a global namespace for large data sets and several enterprise features. The IBM Spectrum Scale feature called transparent cloud tiering allows cloud object storage providers, such as IBM CloudTM Object Storage, IBM Cloud, and Amazon S3, to be used as a storage tier for IBM Spectrum Scale. Transparent cloud tiering can help cut storage capital and operating costs by moving data that does not require local performance to an on-premise or off-premise cloud object storage provider. Transparent cloud tiering reduces the complexity of cloud object storage by making data transfers transparent to the user or application. This capability can help you adapt to a hybrid cloud deployment model where active data remains directly accessible to your applications and inactive data is placed in the correct cloud (private or public) automatically through IBM Spectrum Scale policies. This publication is intended for IT architects, IT administrators, storage administrators, and those wanting to learn more about sizing, configuration, and monitoring of hybrid cloud solutions using IBM Spectrum Scale and transparent cloud tiering.

Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution

Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution
Title Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution PDF eBook
Author Sandeep R. Patil
Publisher IBM Redbooks
Total Pages 30
Release 2018-06-26
Genre Computers
ISBN 0738456969

Download Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution Book in PDF, Epub and Kindle

This IBM® RedpaperTM publication provides guidance on building an enterprise-grade data lake by using IBM SpectrumTM Scale and Hortonworks Data Platform for performing in-place Hadoop or Spark-based analytics. It covers the benefits of the integrated solution, and gives guidance about the types of deployment models and considerations during the implementation of these models. Hortonworks Data Platform (HDP) is a leading Hadoop and Spark distribution. HDP addresses the complete needs of data-at-rest, powers real-time customer applications, and delivers robust analytics that accelerate decision making and innovation. IBM Spectrum ScaleTM is flexible and scalable software-defined file storage for analytics workloads. Enterprises around the globe have deployed IBM Spectrum Scale to form large data lakes and content repositories to perform high-performance computing (HPC) and analytics workloads. It can scale performance and capacity both without bottlenecks.

Integration of IBM Aspera Sync with IBM Spectrum Scale: Protecting and Sharing Files Globally

Integration of IBM Aspera Sync with IBM Spectrum Scale: Protecting and Sharing Files Globally
Title Integration of IBM Aspera Sync with IBM Spectrum Scale: Protecting and Sharing Files Globally PDF eBook
Author Nils Haustein
Publisher IBM Redbooks
Total Pages 78
Release 2019-03-29
Genre Computers
ISBN 0738457493

Download Integration of IBM Aspera Sync with IBM Spectrum Scale: Protecting and Sharing Files Globally Book in PDF, Epub and Kindle

Economic globalization requires data to be available globally. With most data stored in file systems, solutions to make this data globally available become more important. Files that are in file systems can be protected or shared by replicating these files to another file system that is in a remote location. The remote location might be just around the corner or in a different country. Therefore, the techniques that are used to protect and share files must account for long distances and slow and unreliable wide area network (WAN) connections. IBM® Spectrum Scale is a scalable clustered file system that can be used to store all kinds of unstructured data. It provides open data access by way of Network File System (NFS); Server Message Block (SMB); POSIX Object Storage APIs, such as S3 and OpenStack Swift; and the Hadoop Distributed File System (HDFS) for accessing and sharing data. The IBM Aspera® file transfer solution (IBM Aspera Sync) provides predictable and reliable data transfer across large distance for small and large files. The combination of both can be used for global sharing and protection of data. This IBM RedpaperTM publication describes how IBM Aspera Sync can be used to protect and share data that is stored in IBM SpectrumTM Scale file systems across large distances of several hundred to thousands of miles. We also explain the integration of IBM Aspera Sync with IBM Spectrum ScaleTM and differentiate it from solutions that are built into IBM Spectrum Scale for protection and sharing. We also describe different use cases for IBM Aspera Sync with IBM Spectrum Scale.

IBM Software Defined Infrastructure for Big Data Analytics Workloads

IBM Software Defined Infrastructure for Big Data Analytics Workloads
Title IBM Software Defined Infrastructure for Big Data Analytics Workloads PDF eBook
Author Dino Quintero
Publisher IBM Redbooks
Total Pages 180
Release 2015-06-29
Genre Computers
ISBN 0738440779

Download IBM Software Defined Infrastructure for Big Data Analytics Workloads Book in PDF, Epub and Kindle

This IBM® Redbooks® publication documents how IBM Platform Computing, with its IBM Platform Symphony® MapReduce framework, IBM Spectrum Scale (based Upon IBM GPFSTM), IBM Platform LSF®, the Advanced Service Controller for Platform Symphony are work together as an infrastructure to manage not just Hadoop-related offerings, but many popular industry offeringsm such as Apach Spark, Storm, MongoDB, Cassandra, and so on. It describes the different ways to run Hadoop in a big data environment, and demonstrates how IBM Platform Computing solutions, such as Platform Symphony and Platform LSF with its MapReduce Accelerator, can help performance and agility to run Hadoop on distributed workload managers offered by IBM. This information is for technical professionals (consultants, technical support staff, IT architects, and IT specialists) who are responsible for delivering cost-effective cloud services and big data solutions on IBM Power SystemsTM to help uncover insights among client's data so they can optimize product development and business results.

Making Data Smarter with IBM Spectrum Discover: Practical AI Solutions

Making Data Smarter with IBM Spectrum Discover: Practical AI Solutions
Title Making Data Smarter with IBM Spectrum Discover: Practical AI Solutions PDF eBook
Author Ivaylo B. Bozhinov
Publisher IBM Redbooks
Total Pages 170
Release 2020-10-19
Genre Computers
ISBN 0738459135

Download Making Data Smarter with IBM Spectrum Discover: Practical AI Solutions Book in PDF, Epub and Kindle

More than 80% of all data that is collected by organizations is not in a standard relational database. Instead, it is trapped in unstructured documents, social media posts, machine logs, and so on. Many organizations face significant challenges to manage this deluge of unstructured data, such as the following examples: Pinpointing and activating relevant data for large-scale analytics Lacking the fine-grained visibility that is needed to map data to business priorities Removing redundant, obsolete, and trivial (ROT) data Identifying and classifying sensitive data IBM® Spectrum Discover is a modern metadata management software that provides data insight for petabyte-scale file and Object Storage, storage on-premises, and in the cloud. This software enables organizations to make better business decisions and gain and maintain a competitive advantage. IBM Spectrum® Discover provides a rich metadata layer that enables storage administrators, data stewards, and data scientists to efficiently manage, classify, and gain insights from massive amounts of unstructured data. It improves storage economics, helps mitigate risk, and accelerates large-scale analytics to create competitive advantage and speed critical research. This IBM Redbooks® publication presents several use cases that are focused on artificial intelligence (AI) solutions with IBM Spectrum Discover. This book helps storage administrators and technical specialists plan and implement AI solutions by using IBM Spectrum Discover and several other IBM Storage products.