Analytics Blog

Feature Spotlight: Python Execute

Last year we announced the addition of Python Notebooks into the Chorus Platform. This was a long-requested feature which gave teams of data scientists and analysts even more flexibility within Chorus. Notebooks let you easily perform interactive Python analysis on data from any of the sources, Hadoop or database, connected to Chorus. Like Visual Workflows,… Read more »


Spark Summit East 2017: Spark Autotuning with Alpine Data

Last week at Spark Summit East 2017, Alpine Data presented details about technology we have developed for autotuning Spark jobs. Spark can deliver amazing performance allowing data scientists to apply complex machine learning algorithms on large data sets and quickly deliver actionable insights. However, Spark is extremely sensitive to how the Spark job is configured… Read more »


Meet the New Operators in Chorus 6.2

With each release of Chorus, we see product enhancements for usability, integrations, security and more. But there is also continuous growth in the ETL and machine learning algorithms available to users. Many of these quietly slip into the operator list in your Chorus sidebar without notice, so we’d like to take this opportunity to introduce… Read more »


Using Hive to Perform Advanced Analytics in Hadoop

Hadoop data warehouses have continued to gain popularity with solutions such as Hive, Impala and HAWQ now frequently deployed at customer sites. Access to these warehouses is typically tightly controlled using Ranger or Sentry — ensuring comprehensive data security. Due to the ease with which data can be governed in Hive, an increasing number of… Read more »


Deploying Machine Learning to the Cloud

While enterprises have traditionally deployed Hadoop clusters on their data centers, there is a growing number creating clusters in the cloud. Cloud providers such as AWS and GCP make it almost effortless to spin-up and tear-down Hadoop clusters on-demand and provide a cost-effective approach to on-demand big data systems. However, the current analytics solutions offered… Read more »