Ranked #1
Episode 145 – Alex Zeltov on MLOps with mlflow, kubeflow and other tools (part 1)
Episode 145 – Alex Zeltov on MLOps with mlflow, kubeflow and other tools (part 1)
In this episode, Global Black Belt and Technical Architect in Big Data and Advanced Analytics Team at Microsoft, Alex Ze... Read more
18 Jun 2019
•
45mins
Ranked #2
Episode 121 – Infrastructure and Data Lifecycle (part 1)
Episode 121 – Infrastructure and Data Lifecycle (part 1)
Does the standard Dev-Test-Prod cycle make sense in a Big Data environment or should you approach this subject a little ... Read more
1 Jan 2019
•
42mins
Ranked #3
Episode 101 – Apache Pulsar update with Matteo and Sijie from Streamlio
Episode 101 – Apache Pulsar update with Matteo and Sijie from Streamlio
Matteo and Sijie from Streamlio reached out to us and let us know they had an update on Apache Pulsar. It turned out the... Read more
14 Aug 2018
•
1hr 5mins
Ranked #4
Episode 65 – Roaring news
Episode 65 – Roaring news
It's another Roaring News episode. Today Jhon talks about machine learning projects for beginners, data visualization an... Read more
12 Dec 2017
•
36mins
Ranked #5
Episode 155 – NoSQL: You keep using that word…
Episode 155 – NoSQL: You keep using that word…
For a podcast on Big Data, we were amazed that we never covered the subject of NoSQL. So we're correcting this today. No... Read more
27 Aug 2019
•
37mins
Ranked #6
Episode 147 – Alex Zeltov on MLOps with mlflow, kubeflow and other tools (part 2)
Episode 147 – Alex Zeltov on MLOps with mlflow, kubeflow and other tools (part 2)
In this episode, Global Black Belt and Technical Architect in Big Data and Advanced Analytics Team at Microsoft, Alex Ze... Read more
2 Jul 2019
•
44mins
Ranked #7
Episode 78 – Apache Trafodion transactional SQL for Hadoop (Part 2)
Episode 78 – Apache Trafodion transactional SQL for Hadoop (Part 2)
This episode, a group of people from Esgyn join us to talk about the Apache Trafodion transactional SQL for Hadoop datab... Read more
13 Mar 2018
•
1hr 4mins
Ranked #8
Episode 107 – Open Metadata and Governance Masterclass with Mandy Chessell – Part 1
Episode 107 – Open Metadata and Governance Masterclass with Mandy Chessell – Part 1
In this GDPR world, Data Governance and Data Lineage are, or should be, very much top of mind for anybody in the Big Dat... Read more
25 Sep 2018
•
41mins
Ranked #9
Episode 125 – Sparkling Water with H2O.AI (Part 1)
Episode 125 – Sparkling Water with H2O.AI (Part 1)
We recently sat down with Kuba and Pavel from H2O to discuss how you can easily lift your Spark notebooks to the next le... Read more
29 Jan 2019
•
51mins
Ranked #10
Episode 165 – Best Practices for Machine Learning
Episode 165 – Best Practices for Machine Learning
A little while ago we came across a blog by Martin Zinkevich about the best practices for ML Engineering at Google. We h... Read more
5 Nov 2019
•
31mins
Ranked #11
Episode 87 – Druid: a high-performance, column-oriented, distributed data store – part 2
Episode 87 – Druid: a high-performance, column-oriented, distributed data store – part 2
This is the second part of an interview with Fangjin Yang, co-founder and CEO at Imply and committer/PMC member for the ... Read more
8 May 2018
•
31mins
Ranked #12
Episode 74 – Hadoop sizing part 3: Compute sizing
Episode 74 – Hadoop sizing part 3: Compute sizing
As promised, in this final part of our Hadoop Sizing series, we round off the subject with sizing your compute and netwo... Read more
13 Feb 2018
•
49mins
Ranked #13
Episode 68 – Future Predictions
Episode 68 – Future Predictions
Welcome to 2018! And welcome to our 110% fact based prediction show for 2018. As you may expect from your two hosts, eve... Read more
2 Jan 2018
•
48mins
Ranked #14
Episode 137 – Interview on DataOps with Chris Bergh of DataKitchen.io (Part 1)
Episode 137 – Interview on DataOps with Chris Bergh of DataKitchen.io (Part 1)
DataKitchen.io's Chris Bergh takes us down the path towards successful DataOps implementation.If you have not heard of t... Read more
23 Apr 2019
•
45mins