Monday, April 30, 2018

Build State of Art AI Deep Learning Systems

Newer AI and Machine learning applications require High Performmance Computing (HPC) and massive storage. HPE offers a full suite of HPC servers.



HPE Apollo 6500 Gen10 System is an ideal HPC and deep learning platform providing unprecedented performance with industry leading GPUs, fast GPU interconnect, high bandwidth fabric and a configurable GPU topology to match your workloads. The ability of computers to autonomously learn, predict, and adapt using massive data sets is driving innovation and competitive advantage across many industries and applications are driving these requirements.

The system with rock-solid reliability, availability, and serviceability (RAS) features includes up to eight GPUs per server, NVLink 2.0 for fast (up to 300 GB/s) GPU-to-GPU communication, Intel® Xeon® Scalable processors support, choice of high-speed / low latency fabric, and is workload enhanced using flexible configuration capabilities. While aimed at deep learning workloads, AI models that would consume days or weeks can now be trained in a few hours or minutes.

HPE SDS Storage Solutions

HPE Solution for Intel Enterprise Edition for Lustre is a high-performance compute (HPC) storage solution that includes the HPE Apollo 4520 System and Intel® Enterprise Edition for Lustre*. The Apollo 4520 is a dual-node system with up to 46 drives. Capacity can be increased by adding additional drives in a disk enclosure. The solution can scale by adding more systems in parallel to scale performance and capacity nearly linearly.

Scality RING running on HPE ProLiant servers provides a SDS solution for petabyte-scale data storage that is designed to interoperate in the modern SDDC. The RING software is designed to create a scale-out storage system, which is deployed as a distributed system on a minimum cluster of six storage servers.

Scality Ring Object storage software solutions are designed to run on industry-standard server platforms, offering lower infrastructure costs and scalability beyond the capacity points of typical file server storage subsystems. The HPE Apollo 4200 series servers provide a comprehensive and cost-effective set of storage building blocks for customers that wish to deploy an object storage software solution.

Vertica v9

Formerly, HPE Product, Vertica Version 9 delivers High-Performance In-Database Machine Learning and Advanced Analytics, Unified advanced analytics database features advancements in in-database Machine Learning.


Friday, April 13, 2018

Active Data Management for Big Data Analytics


Better Way to Manage Big Data with Containers

Big Data analytics using Hadoop & Mapreduce is best done in containers instead of bare metal servers and dedicated servers. Running big data analytics on containers using Kubernetes is faster, and costs less.

Storing data in Software Defined Storage systems (using HDFS over Object Store) allows for shared storage and also allows for in-place analytics, this is more efficient than copying all the files over to local disks - thus saving time & getting faster time to insights.

Also running on containers, allows IT to offer BigData-as-a-Service model as the entire infrastructure can be managed with existing cloud management tools, and this simplifies & reduces time for new  deployments.

Business Challenges & Technology Requirements for IoT


Sunday, April 08, 2018

Data Storage Tiers and HPE Solutions

Data centered economy implies that organizations need to store vast volumes of data, and this data has to be stored in an economic way - so as to get best performance/cost ratios. This implies creating multi-tier data storage systems, and having automated data management systems - such as HPE DMF solutions.

Automating Data management system over multiple tiers and building a cost effective SDS  systems is the way forward.

Today, HPE offers several SDS solutions for each data storage tiers.

Tier-1: VMWARE vSAN, HPE Simplivity & HPE VSA 
Tier-2: Lusture SDS with ZFS for scaleout NAS storage 
Tier-3: Scality Ring Object Store
Tier-4: HDD based RDX Cartridges for long term off-site secure backup

Tier 1 : This is frequently accessed storage, best build on SSD drives. NVMe SSD drives offer highest IOPS and throughput, SAS SSD offers highest capacity - while offering very high levels of IOPS performance.

Tier-2 :  Frequently used data, often files used by individuals. These files are best stored in a scale out NAS system. HPE offers Lusture ZFS built on Apollo 4500 servers - which offers infinite storage capability and at low cost solution when compared to dedicated NAS arrays.

Tier-3 :  Archived Object Store data, used infrequently but has to be stored for business purposes. HPE offers Scality Ring Object Store solution built on Apollo 4500 servers - which offers infinite storage capability and at very low cost solution  - which does not need backup, as data is replicated across multiple datacenters. 

Tier-4 : Backup Data stored in secure off line, off site location. Historically, tape storage was used for this backup, but with low cost HDD and very high reliability of Hard Drives, companies can use HDD for backup storage. HPE offers FLX HD cartridge solutions for long term data backup and off site archival.



Thursday, April 05, 2018

Data Preparation Process


Data preparation is the first step in modern data analytics and BI, data science, and data integration.

Data preparation takes more than 60% of the data analytics time. With business is demanding faster time to insight to remain competitive, analytics is becoming more pervasive  across the enterprise and those insights are being derived from larger numbers of diverse data sources, both internal and external to the enterprise, with varying degrees of trustworthiness. This increases complexity.

Data preparation processes reduces time to insight for analytics & is the first step for data analytics.