Analytics | cognitree

Traffic insights in real time using Sankey charts in Kibana

There are many challenges involved in visualizing application traffic patterns. We first need to visualize the sequence of components along various flows of the traffic. Then we need to filter... read more →

Big data stack on Kubernetes

At Cognitree, most of our customer projects involve analysing a large amount of data to gather insights. These projects involve design and development of ETL pipelines, real-time analysis, report generation... read more →

Time series data management in Elasticsearch

In this blog post, we’d like to outline how we defined policies for time series data management in Elasticsearch. Background As part of an IoT security solution built for a... read more →

User defined aggregate functions (UDAF) in Spark

Apart from the pre-built functions available for data analysis, Spark enables developers to write custom user defined functions that can be applied on a single row, a group of rows... read more →

Kronos – a cron replacement to schedule complex data workflows

Increasing need for insights from vast data sources has given rise to data-driven business intelligence products which build and execute complex data workflows. A data workflow is a set of... read more →

Flume sink plugin for Elasticsearch 6.x

Version 2.0.0 of Flume sink plugin is now available. The release includes support for Flume 1.8.0 and Elasticsearch 6.2.4. Many thanks to Alexey Mikka for his contributions. Please use Cognitree's... read more →

Classloaders for Apache Flume plugins in 1.8.0

Last year, Cognitree enhanced Apache Flume 1.7.0 with a support for using classloaders to load plugins. We have now ported the support for users of Flume 1.8.0. The binary distribution... read more →

Flume sink plugin for Elasticsearch 5.x

Cognitree has open sourced a Flume sink plugin for Elasticsearch 5.4. The sink plugin is compatible with Flume version 1.7. To avoid dealing with versioning hell for dependencies we highly... read more →

Classloaders for Apache Flume plugins

Apache Flume is a tool for moving large amounts of data from various sources to a centralized data store. It provides an extensible framework to expand its applicability to various... read more →

Hive on HBase

Hive provides insights into the data present in HBase (and HDFS) by responding to ad hoc queries. Queries can be written in HQL(Hive Query Language) which are sql like. Hive... read more →

Categories