Understand The Language Of Data: Strata+Hadoop World and TAP

Federal Cloud Computing Strategy Officially Launched

By G C Network | February 14, 2011

Federal CIO Vivek Kundra officially launched the Federal Cloud Computing Strategy today. While this is clearly not new news, the document does state the government’s position in a very succint manner.…

GEOINT’s Future is in the Cloud

By G C Network | January 31, 2011

Recently, Geospatial Intelligence Forum Magazine asked me for my thoughts on the role of cloud computing in the future of geospatial intelligence.My response was recently published in their December 2010…

eTechSuccess: Patterns of Success – Kevin Jackson

By G C Network | January 27, 2011

 My sincere appreciation to John Baker for the eTechSuccess: Patterns of Success interview. John and I worked together IBM as part of the Wireless Emerging Business Organization. His team and…

USBE&IT Winter Issue Focuses on Cyber Security

By G C Network | January 19, 2011

Thank You USBE&IT Publisher Mr Tyrone Taborn for such an inspiring issue and my sincere appreciation to Mr. Frank McCoy for my inclusion in his list of Cyber visionaries! The Homeland…

Global GovCloud with Cisco and VCE

By G C Network | January 18, 2011

Last week I had the awesome experience of participating in a global telepresence conference on government cloud computing. Joining me as presenters were Blake Salle, Senior Vice President of VCE,…

NIST Cloud Computing Collaboration Twiki Launches

By G C Network | December 30, 2010

Today I received my credentials for the NIST Cloud Computing Collaboration Site. “The National Institute of Standards and Technology (NIST) has been designated by Federal Chief Information Officer Vivek Kundra…

GovCloud Predicitons for 2011

By G C Network | December 30, 2010

Happy New Year All!! 2011 will be the breakout year for GovCloud! Pressure to reduce budget, pressure to manage I resources better and the political pressure of the next presidential…

Vivek Kundra Unveils 25-Point IT Management Reform Program

By G C Network | December 10, 2010

Yesterday the US Federal CIO, Vivek Kundra, unveiled an ambitious 25-point implementation plan for delivering more value to the American taxpayer. This plan focuses on execution and is designedto establish…

GSA and Unisys/Google Marks GovCloud Watershed

By G C Network | December 4, 2010

As widely reported this week, the United States General Services Administration (GSA) has awarded a contract to Unisys to create a secure cloud-based email and collaboration platform. The solution will…

NIST Moves Forward on Cloud Computing

By G C Network | November 8, 2010

Last week the National Institute of Standards and Technology (NIST) held their second Cloud Computing Forum and Workshop. Skillfully shepherded by Ms. Dawn Leaf, the agency’s senior executive of cloud computing,…

Our world is driven by data.  It may speak in whispers, but it can also scream insight and information to those that understand it’s language. This is why I’ll be attending Strata+Hadoop World, Sept 26th to 29th, in New York City.

Even though data can also speak many different languages, data scientist act as our interpreters and guides.  They help us survive and thrive in this data-driven world by addressing and taming the many business challenges it presents, including:
  • An appropriate interpretive language, be it The language itself algebraic notation, an adapted programming language or both;
  • Separating the data signal from the data noise;
  • The enablement of data access and data connectivity within the enterprise;
  • Handling the complexity and variety of complex data which can include images, videos and abstract representations of both the physical and living world;
  • Integration of the time variable into the data interpretation process;
  • Security and protection of the data; and
  • Collaboration with a strong and innovative technology partner.[1]

That last challenge is actually why I’m anxious to learn more about the Trusted Analytics Platform (TAP), open source software optimized to create cloud-native data analytics applications. This multi-tenant platform contains connectors for data ingestion, multiple distributed data stores, advanced processing engines and collaborative analytics capabilities.  It even includes machine learning, model building and visualization within a multi-language application runtime environment. This last feature enables developers and data scientists to use the languages with which they are most familiar. At every layer of the platform, performance optimizations maximize analytic operation speed.  Data security enhancements are also embedded, from the silicon up, to ensure protection of both the data and processing.

Instead of starting from scratch and deploying a host of different tools, packages and services, TAP provides an extensible environment that combines many open-source components into a single, integrated platform.  This integrated architecture provides the APIs, services and extensibility to support the needs of data scientists and application developers for varied analytics on virtually any data, of any size, located anywhere. It also provides management tools and services to control and monitor operations from top to bottom.

TAP also includes a rich marketplace where tools and services can be easily integrated and provisioned on demand. This marketplace is accessible through a simple, browser-based interface to a purpose-built service catalog. Application developers, data scientists and system operators all have the flexibility to choose the tools and services that they need for ingestion, storage or manipulation of data. In addition, system operators can add services to the TAP Marketplace in their instance of TAP, which saves time by eliminating the need to identify and curate key tools and libraries. All of this is done in a secure and collaborative high performance environment. A growing number of organizations support, use and contribute to TAP in order to address many use cases like:

  • Customer behavior analysis using wearable IT systems;
  • Tracking disease progression and treatment;
  • Asset management using RFID data;
  • Equipment failure prediction and optimization using sensor data; and
  • Privacy-preserving genomic analysis using diverse distributed data sets.

Join me in New York next week at Strata+Hadoop World to learn more. To prepare, you can read TAP documentation and code at https://github.com/trustedanalytics, visit their public Jira at https://trustedanalytics.atlassian.netor contact them directly at [email protected].



[1] https://dzone.com/articles/challenges-of-bigdata

( This content is being syndicated through multiple channels. The opinions expressed are solely those of the author and do not represent the views of GovCloud Network, GovCloud Network Partners or any other corporation or organization.)

Cloud Musings

( Thank you. If you enjoyed this article, get free updates by email or RSS – © Copyright Kevin L. Jackson 2015)

Follow me at https://Twitter.com/Kevin_Jackson
Posted in

G C Network