Understand The Language Of Data: Strata+Hadoop World and TAP

Cloud computing: A data-centric business model

By G C Network | October 3, 2015

According to the National Institute of Standards and Technology: “Cloud computing is a model for enabling ubiquitous, convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, servers,…

John Mayer At Dell World 2015!! (Oh, I’ll be there too.)

By G C Network | September 30, 2015

An artist who defies all boundaries, John Mayer has won seven Grammy Awards and sold more than 17 million albums worldwide. The singer, songwriter and guitarist’s skills have been widely…

Data-centric Security: The New Must Have

By G C Network | September 23, 2015

Where is your data right now? The explosion of cloud computing and consumer IT means that your data, as well as data about you, can be virtually anywhere.Having your data and the…

Personal email:Pathway to Cybersecurity Breaches

By G C Network | September 14, 2015

As a business communications tool, email is the dominant option, and many corporations have policies that allow the use of personal email on corporate computers. In a recent Adobe Systems…

IEEE Cloud Computing: Legal Clouds

By G C Network | September 11, 2015

The new issue of IEEE Cloud Computing is now available!   This special issue looks at how to balance privacy with legitimate surveillance and lawful data access. Some of the…

Cloud hosting: Look beyond cost savings and weigh pros, cons

By G C Network | September 3, 2015

Is your company struggling with the idea of using “cloud hosting” in order to save money? Truth be known, using cost savings as the primary reason for moving to cloud…

“Cloud First” Lessons Learned from ViON

By G C Network | August 25, 2015

In 2011, then United States CIO Vivek Kundra released the US Federal Cloud Computing Strategy [1]. In the executive summary he pointed to cloud computing as a key component of…

Looking for Security Peak Performance?

By G C Network | August 19, 2015

You can find it at Dell Peak Performance 2015!!! I’ll be there at the Aria Resort and Casino in Las Vegas attending as a social media correspondent with a full…

The Cybersecurity Sprint: Are we safe yet?

By G C Network | August 7, 2015

UPDATE: NBC News reports U.S. officials have disclosed a hack of the Pentagon’s Joint Staff unclassified email system, which took place on July 25. Recent unauthorized access to a U.S. government database…

Cloud Computing + Things = “Information Excellence”, Not IoT

By G C Network | July 31, 2015

The Internet of Things (IoT) has quickly become the next “be all to end all” in information technology. Touted as how cloud computing will connect everyday things together, it is…

Our world is driven by data.  It may speak in whispers, but it can also scream insight and information to those that understand it’s language. This is why I’ll be attending Strata+Hadoop World, Sept 26th to 29th, in New York City.

Even though data can also speak many different languages, data scientist act as our interpreters and guides.  They help us survive and thrive in this data-driven world by addressing and taming the many business challenges it presents, including:
  • An appropriate interpretive language, be it The language itself algebraic notation, an adapted programming language or both;
  • Separating the data signal from the data noise;
  • The enablement of data access and data connectivity within the enterprise;
  • Handling the complexity and variety of complex data which can include images, videos and abstract representations of both the physical and living world;
  • Integration of the time variable into the data interpretation process;
  • Security and protection of the data; and
  • Collaboration with a strong and innovative technology partner.[1]

That last challenge is actually why I’m anxious to learn more about the Trusted Analytics Platform (TAP), open source software optimized to create cloud-native data analytics applications. This multi-tenant platform contains connectors for data ingestion, multiple distributed data stores, advanced processing engines and collaborative analytics capabilities.  It even includes machine learning, model building and visualization within a multi-language application runtime environment. This last feature enables developers and data scientists to use the languages with which they are most familiar. At every layer of the platform, performance optimizations maximize analytic operation speed.  Data security enhancements are also embedded, from the silicon up, to ensure protection of both the data and processing.

Instead of starting from scratch and deploying a host of different tools, packages and services, TAP provides an extensible environment that combines many open-source components into a single, integrated platform.  This integrated architecture provides the APIs, services and extensibility to support the needs of data scientists and application developers for varied analytics on virtually any data, of any size, located anywhere. It also provides management tools and services to control and monitor operations from top to bottom.

TAP also includes a rich marketplace where tools and services can be easily integrated and provisioned on demand. This marketplace is accessible through a simple, browser-based interface to a purpose-built service catalog. Application developers, data scientists and system operators all have the flexibility to choose the tools and services that they need for ingestion, storage or manipulation of data. In addition, system operators can add services to the TAP Marketplace in their instance of TAP, which saves time by eliminating the need to identify and curate key tools and libraries. All of this is done in a secure and collaborative high performance environment. A growing number of organizations support, use and contribute to TAP in order to address many use cases like:

  • Customer behavior analysis using wearable IT systems;
  • Tracking disease progression and treatment;
  • Asset management using RFID data;
  • Equipment failure prediction and optimization using sensor data; and
  • Privacy-preserving genomic analysis using diverse distributed data sets.

Join me in New York next week at Strata+Hadoop World to learn more. To prepare, you can read TAP documentation and code at https://github.com/trustedanalytics, visit their public Jira at https://trustedanalytics.atlassian.netor contact them directly at [email protected].



[1] https://dzone.com/articles/challenges-of-bigdata

( This content is being syndicated through multiple channels. The opinions expressed are solely those of the author and do not represent the views of GovCloud Network, GovCloud Network Partners or any other corporation or organization.)

Cloud Musings

( Thank you. If you enjoyed this article, get free updates by email or RSS – © Copyright Kevin L. Jackson 2015)

Follow me at https://Twitter.com/Kevin_Jackson
Posted in

G C Network