Friday, April 13, 2018

Better Way to Manage Big Data with Containers

Big Data analytics using Hadoop & Mapreduce is best done in containers instead of bare metal servers and dedicated servers. Running big data analytics on containers using Kubernetes is faster, and costs less.

Storing data in Software Defined Storage systems (using HDFS over Object Store) allows for shared storage and also allows for in-place analytics, this is more efficient than copying all the files over to local disks - thus saving time & getting faster time to insights.

Also running on containers, allows IT to offer BigData-as-a-Service model as the entire infrastructure can be managed with existing cloud management tools, and this simplifies & reduces time for new  deployments.

No comments: