Back to Browse

Apache Druid Adoption – Shape Incoming Data Effectively

194 views
Sep 29, 2022
17:35

To build any application, engineers need to prepare the data they want to surface, making it ready for the chosen database and ready for their users. Peter Marshall, Director of Imply’s Developer Relations team, shares community wisdom around making data ready for Apache Druid® ingestion and query, signposting you to need-to-know techniques for delivering modern analytics applications powered by Druid. Get hands-on training for free at: https://learn.imply.io/ Learn more: Using DimensionsSpec for inclusions, exclusions, and schemaless ingestion: https://druid.apache.org/docs/latest/ingestion/ingestion-spec.html#dimensionsspec TransformsSpec for expressions and filtering: https://druid.apache.org/docs/latest/ingestion/ingestion-spec.html#transformspec Using DimensionsSpec for indexes: https://druid.apache.org/docs/latest/ingestion/ingestion-spec.html#dimensionsspec Clustering data with partitioning: https://druid.apache.org/docs/latest/ingestion/partitioning.html Read about compaction: https://druid.apache.org/docs/latest/ingestion/compaction.html Read about rollup: https://druid.apache.org/docs/latest/ingestion/rollup.html Read about sketch functions: https://druid.apache.org/docs/latest/querying/sql-scalar.html#sketch-functions SQL JOIN operations in Druid: https://druid.apache.org/docs/latest/querying/joins.html Read about updating data: https://druid.apache.org/docs/latest/ingestion/data-management.html#updating-existing-data Connect: Join the Apache Druid workspace on Slack: https://druid.apache.org/community/join-slack Subscribe: https://www.youtube.com/c/Implydata Apache Druid GitHub: https://github.com/apache/druid Twitter: https://twitter.com/implydata LinkedIn: https://www.linkedin.com/company/imply/ About Imply Developers are in the driver’s seat when it comes to analytics, building applications that serve real-time insights on terabytes to petabytes of streaming and batch data at hundreds to thousands of queries per second. With Imply, developers have a database that is uniquely built for these analytics applications, delivering sub-second queries at scale and under load. The result? No spinning wheel and no limit to the analytics in their applications. Check us out at https://imply.io/

Download

1 formats

Video Formats

360pmp427.4 MB

Right-click 'Download' and select 'Save Link As' if the file opens in a new tab.

Apache Druid Adoption – Shape Incoming Data Effectively | NatokHD