Understand The Language Of Data: Strata+Hadoop World and TAP

Packing My Bags For Prague and Dimension Data #Perspectives2015

By G C Network | May 15, 2015

Prague is a beautiful city!  My last time was in June 2010 when Jeremy Geelan invited me to speak at CloudExpo Europe (see my blog post and video from that…

SAP/HANA Does Big Data for National Security

By G C Network | May 13, 2015

Carmen Krueger, SAP NS2 SVP & GM While SAP is globally renowned as a provider of enterprise management software, the name is hardly ever associated with the spooky world of…

Be future ready: Selling to millennials and a marketplace of one

By G C Network | May 12, 2015

There is almost a deafening discussion going on about the self-centeredness of today’s young adults. Weather you call them Generation Y, millennials or twenty-somethings, the general refrain seems to be…

Surviving an Environment of IT Change

By G C Network | May 8, 2015

  “The Federal government today is in the midst of a revolution. The revolution is challenging the norms of government by introducing new ways of serving the people. New models…

OmniTI and GovCloud Join Forces to Provide Cloud-based Services

By G C Network | May 5, 2015

FULTON, Md.–(BUSINESS WIRE)–OmniTI, a leading provider of web infrastructures and applications for companies that require scalable, high-performance, mission critical solutions, today announced that it has partnered with GovCloud Network, LLC…

Cloud microservices make their play

By G C Network | April 29, 2015

 by Kevin L. Jackson Cloud computing seems destined to be the way enterprises will use information technology. The drastic cost reductions and impressive operational improvements make the transition an unstoppable trend.…

Tweeps Are People Too!!

By G C Network | April 25, 2015

I woke up this morning to the devastating news about the earthquake in Nepal. Sitting here in California  that destruction is literally on the other side of the world but…

The CISO role in cybersecurity: Solo or team sport?

By G C Network | April 14, 2015

The average length of time in the commercial sector between a network security breach and when the detection of that breach is more than 240 days, according to Gregory Touhill, deputy…

Setting standards for IoT can capitalize on future growth

By G C Network | March 30, 2015

by Melvin Greer Managing Director Greer Institute for Leadership and Innovation The adoption of Internet of Things (IoT) appears to be unquestioned. Advances in wearables and sensors are strategic to…

Women in tech: Meet the trailblazers of STEM equality

By G C Network | March 19, 2015

By Sandra K. Johnson CEO, SKJ Visioneering, LLC   Science, technology, engineering and mathematics (STEM) professionals are drivers of innovation,creativity and invention. STEM disciplines are significant drivers of economies worldwide,…

Our world is driven by data.  It may speak in whispers, but it can also scream insight and information to those that understand it’s language. This is why I’ll be attending Strata+Hadoop World, Sept 26th to 29th, in New York City.

Even though data can also speak many different languages, data scientist act as our interpreters and guides.  They help us survive and thrive in this data-driven world by addressing and taming the many business challenges it presents, including:
  • An appropriate interpretive language, be it The language itself algebraic notation, an adapted programming language or both;
  • Separating the data signal from the data noise;
  • The enablement of data access and data connectivity within the enterprise;
  • Handling the complexity and variety of complex data which can include images, videos and abstract representations of both the physical and living world;
  • Integration of the time variable into the data interpretation process;
  • Security and protection of the data; and
  • Collaboration with a strong and innovative technology partner.[1]

That last challenge is actually why I’m anxious to learn more about the Trusted Analytics Platform (TAP), open source software optimized to create cloud-native data analytics applications. This multi-tenant platform contains connectors for data ingestion, multiple distributed data stores, advanced processing engines and collaborative analytics capabilities.  It even includes machine learning, model building and visualization within a multi-language application runtime environment. This last feature enables developers and data scientists to use the languages with which they are most familiar. At every layer of the platform, performance optimizations maximize analytic operation speed.  Data security enhancements are also embedded, from the silicon up, to ensure protection of both the data and processing.

Instead of starting from scratch and deploying a host of different tools, packages and services, TAP provides an extensible environment that combines many open-source components into a single, integrated platform.  This integrated architecture provides the APIs, services and extensibility to support the needs of data scientists and application developers for varied analytics on virtually any data, of any size, located anywhere. It also provides management tools and services to control and monitor operations from top to bottom.

TAP also includes a rich marketplace where tools and services can be easily integrated and provisioned on demand. This marketplace is accessible through a simple, browser-based interface to a purpose-built service catalog. Application developers, data scientists and system operators all have the flexibility to choose the tools and services that they need for ingestion, storage or manipulation of data. In addition, system operators can add services to the TAP Marketplace in their instance of TAP, which saves time by eliminating the need to identify and curate key tools and libraries. All of this is done in a secure and collaborative high performance environment. A growing number of organizations support, use and contribute to TAP in order to address many use cases like:

  • Customer behavior analysis using wearable IT systems;
  • Tracking disease progression and treatment;
  • Asset management using RFID data;
  • Equipment failure prediction and optimization using sensor data; and
  • Privacy-preserving genomic analysis using diverse distributed data sets.

Join me in New York next week at Strata+Hadoop World to learn more. To prepare, you can read TAP documentation and code at https://github.com/trustedanalytics, visit their public Jira at https://trustedanalytics.atlassian.netor contact them directly at [email protected].



[1] https://dzone.com/articles/challenges-of-bigdata

( This content is being syndicated through multiple channels. The opinions expressed are solely those of the author and do not represent the views of GovCloud Network, GovCloud Network Partners or any other corporation or organization.)

Cloud Musings

( Thank you. If you enjoyed this article, get free updates by email or RSS – © Copyright Kevin L. Jackson 2015)

Follow me at https://Twitter.com/Kevin_Jackson
Posted in

G C Network