Understand The Language Of Data: Strata+Hadoop World and TAP

A group of people standing in front of a large screen.

2018 AT&T Business Summit: Security “in” and “of” the Cloud

By G C Network | November 10, 2018

While public cloud is undoubtedly an outsized piece of the conversation, news headlines of the latest data breach can make this move a very frightening proposition. The question of how…

A group of people standing in front of a display.

My Brush with Royalty: Queen Latifah

By G C Network | November 9, 2018

Queen Latifah! Hip Hop Icon. Movie Star. Television Star. Fashion Model. Songwriter. Producer. Entrepreneurial Genius!? YES! Dana Elaine Owens, her given name, is co-owner of Flavor Unit Entertainment, a firm that includes…

What’s New in Puppet 5?

By G C Network | November 9, 2018

Puppet 5 is released and comes with several exciting enhancements and features that promise to make configuration management much more streamlined. This article will take a comprehensive look at these…

5 Reasons Why Ansible is the Best CM Tool Out There?

By G C Network | November 9, 2018

Amidst volatile markets, dynamic technology shifts, and ever-increasing customer demands, it is imperative for IT organizations to develop flexible, scalable and high-quality applications that exceed expectations and enhance productivity. A…

Machine learning APIs for Google Cloud Platform

By G C Network | November 9, 2018

Google Cloud Platform (GCP) is considered to be one of the Big 3 cloud platforms among Microsoft Azure and AWS. GCP is widely used cloud solutions supporting AI capabilities to design and…

What Is The Most Important Part of Architecture?

By G C Network | November 9, 2018

I always find it interesting to hear what people view architecture as. A lot of people think it’s just about the design aspect, where you get to put pen to…

Cloud migration best practice Part 4: Executing the migration

By G C Network | September 16, 2018

This series has stepped through cloud migration best practices. After providing an overview, we discussed: Classifying business-critical data. Updating organizational IT governance policies. Application screening and cloud workload selection. How to…

Cloud Migration Best Practice Part 3: Application Portfolio Analysis

By G C Network | September 16, 2018

In part three of this series on cloud migration best practice, I will focus on migrating the application itself. If you haven’t had the opportunity to read our recommendations from part…

Why Use Immutable Storage?

By G C Network | September 11, 2018

Data has become a global currency, and its value has nowhere to go but up.  According to The Economist online, the world’s most valuable resource is no longer oil, but…

CLOUD ACT: What Does That Mean for Your Cloud Storage

By G C Network | August 31, 2018

When Congress names a law after you, it’s getting serious.  That is where we are now with cloud computing.  The Clarifying Lawful Overseas Use of Data Act or CLOUD Act (H.R. 4943) is…

Our world is driven by data.  It may speak in whispers, but it can also scream insight and information to those that understand it’s language. This is why I’ll be attending Strata+Hadoop World, Sept 26th to 29th, in New York City.

Even though data can also speak many different languages, data scientist act as our interpreters and guides.  They help us survive and thrive in this data-driven world by addressing and taming the many business challenges it presents, including:
  • An appropriate interpretive language, be it The language itself algebraic notation, an adapted programming language or both;
  • Separating the data signal from the data noise;
  • The enablement of data access and data connectivity within the enterprise;
  • Handling the complexity and variety of complex data which can include images, videos and abstract representations of both the physical and living world;
  • Integration of the time variable into the data interpretation process;
  • Security and protection of the data; and
  • Collaboration with a strong and innovative technology partner.[1]

That last challenge is actually why I’m anxious to learn more about the Trusted Analytics Platform (TAP), open source software optimized to create cloud-native data analytics applications. This multi-tenant platform contains connectors for data ingestion, multiple distributed data stores, advanced processing engines and collaborative analytics capabilities.  It even includes machine learning, model building and visualization within a multi-language application runtime environment. This last feature enables developers and data scientists to use the languages with which they are most familiar. At every layer of the platform, performance optimizations maximize analytic operation speed.  Data security enhancements are also embedded, from the silicon up, to ensure protection of both the data and processing.

Instead of starting from scratch and deploying a host of different tools, packages and services, TAP provides an extensible environment that combines many open-source components into a single, integrated platform.  This integrated architecture provides the APIs, services and extensibility to support the needs of data scientists and application developers for varied analytics on virtually any data, of any size, located anywhere. It also provides management tools and services to control and monitor operations from top to bottom.

TAP also includes a rich marketplace where tools and services can be easily integrated and provisioned on demand. This marketplace is accessible through a simple, browser-based interface to a purpose-built service catalog. Application developers, data scientists and system operators all have the flexibility to choose the tools and services that they need for ingestion, storage or manipulation of data. In addition, system operators can add services to the TAP Marketplace in their instance of TAP, which saves time by eliminating the need to identify and curate key tools and libraries. All of this is done in a secure and collaborative high performance environment. A growing number of organizations support, use and contribute to TAP in order to address many use cases like:

  • Customer behavior analysis using wearable IT systems;
  • Tracking disease progression and treatment;
  • Asset management using RFID data;
  • Equipment failure prediction and optimization using sensor data; and
  • Privacy-preserving genomic analysis using diverse distributed data sets.

Join me in New York next week at Strata+Hadoop World to learn more. To prepare, you can read TAP documentation and code at https://github.com/trustedanalytics, visit their public Jira at https://trustedanalytics.atlassian.netor contact them directly at trustedanalytics@gmail.com.



[1] https://dzone.com/articles/challenges-of-bigdata

( This content is being syndicated through multiple channels. The opinions expressed are solely those of the author and do not represent the views of GovCloud Network, GovCloud Network Partners or any other corporation or organization.)

Cloud Musings

( Thank you. If you enjoyed this article, get free updates by email or RSS – © Copyright Kevin L. Jackson 2015)

Follow me at https://Twitter.com/Kevin_Jackson
Posted in

G C Network