Главная
Study mode:
on
You
History
Saved
In progress
0 courses
compleat
0 courses
#Art & Design
#Adobe
#ChatGPT
#GitHub
#Data Pipelines
Showing:
318
courses
Sort by Relevancy
Highest rated
Lowest rated
Most recently added
NPTEL
Practical Machine Learning with Tensorflow
1
rewiews
We will cover the basics of Tensorflow and Machine Learning in the initial sessions and advanced topics in the latter part. After this course, the students will be able to build ML models using Tensorflow.
Add to list
34
Lesons
16 hours
On-Demand
Free-Video
MLOps.community
Implementing Data Capture for ML Observability and Drift Detection
0
rewiews
Explore the implementation of data capture for machine learning observability and drift detection in this 24-minute conference talk by Pushkar Garg. Dive into the complexities of modern ML systems, including data pipelines and transformations across mult…
Add to list
1
Lesons
24 minutes
On-Demand
Free-Video
MLOps.community
Going Beyond Two-Tier Data Architectures with DuckDB
0
rewiews
Explore the innovative data architectures enabled by DuckDB, an in-process analytical data management system, in this 32-minute talk by Prof. Dr. Hannes Mühleisen. Discover how DuckDB's lightweight yet powerful design, available under the MIT license, al…
Add to list
1
Lesons
32 minutes
On-Demand
Free-Video
MLOps.community
Reproducible Data Science Over Data Lakes
0
rewiews
Explore reproducible data science techniques for data lakes in this 12-minute conference talk by Ciro Greco, presented at DE4AI. Delve into the challenges of achieving reproducibility in Lakehouse architectures and discover recent advancements made at Ba…
Add to list
1
Lesons
12 minutes
On-Demand
Free-Video
MLOps.community
The Evolution of Lyft's ML Feature Store
0
rewiews
Explore the evolution of Lyft's machine learning feature store in this 13-minute conference talk presented by Devon Mittow, a Staff Data Engineer with over a decade of experience in data engineering across various industries. Gain insights into how Lyft'…
Add to list
1
Lesons
13 minutes
On-Demand
Free-Video
CodeWithYu
Realtime Streaming for Anomaly Detection - An End-to-End Data Engineering Project
0
rewiews
Build a real-time anomaly detection system using streaming data engineering techniques. Learn end-to-end implementation for monitoring and identifying unusual patterns in data streams.
Add to list
1
Lesons
1 hour 6 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Building Scalable ML Services for Rapid Development in Health and Wellness
0
rewiews
Discover how to build and scale ML services for health and wellness products, reducing release time by 85% using open-source tools like DBT, AirFlow, and MLFlow.
Add to list
1
Lesons
27 minutes
On-Demand
Free-Video
Dynatrace
Anomaly Detection on 5 Pillars of Data Observability with Dynatrace Davis AI
0
rewiews
Explore Dynatrace Davis AI's anomaly detection for data observability pillars: freshness, distribution, volume, schema, and lineage. Learn automation techniques for alerting and monitoring data integrity.
Add to list
8
Lesons
34 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
From Idea to Production: AI Infrastructure for Scaling LLM Applications
0
rewiews
Explore strategies for scaling LLM applications from beta to production, addressing challenges and building adaptable AI infrastructure for evolving models and workflows.
Add to list
1
Lesons
38 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Building Real Time ML Pipelines with a Feature Store
0
rewiews
Explore real-time ML pipeline construction using feature stores. Learn to optimize compute efficiency and enhance ML workflows for production environments.
Add to list
1
Lesons
31 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Lessons Learned from DAG-Based Workflow Orchestration
0
rewiews
Explore DAG-based workflow orchestration challenges and learn about Prefect Orion's DAG-less system for enhanced runtime flexibility and developer experience in data pipelines.
Add to list
1
Lesons
39 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Building Real-Time ML Features with a Feature Platform
0
rewiews
Explore challenges in deploying ML pipelines and learn how feature stores and platforms solve data problems for production ML, enabling real-time processing and scaling.
Add to list
1
Lesons
34 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
The Critical Things You Have to Build to Transform Your Company to be ML Driven
0
rewiews
Discover key components for transforming your company into an ML-driven organization. Learn strategies for overcoming challenges in the machine learning development lifecycle and deploying models effectively.
Add to list
1
Lesons
39 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Airflow: Where Data Engineers and ML Engineers Meet
0
rewiews
Explore Apache Airflow's role in operational machine learning with a real-world demo and reference implementation of GenAI in production.
Add to list
1
Lesons
10 minutes
On-Demand
Free-Video
The Machine Learning Engineer
GraphRAG - Unleashing the Power of Knowledge Graphs with LLMs
0
rewiews
Explore GraphRAG: AI-powered content interpretation using LLMs to create knowledge graphs from unstructured text, enabling advanced search and question-answering capabilities.
Add to list
1
Lesons
1 hour 1 minute
On-Demand
Free-Video
Anyscale
ByteDance's Platform for Reinforcement Learning from Human Feedback
0
rewiews
Explore ByteDance's journey in building a video data processing pipeline using Ray's ecosystem for creating realistic video generation models from text instructions.
Add to list
1
Lesons
32 minutes
On-Demand
Free-Video
Apache Airflow Tutorials
0
rewiews
Learn Apache Airflow fundamentals, set up environments, write pipelines, and integrate with Google Cloud services. Gain practical skills for efficient workflow automation and data pipeline management.
Add to list
7
Lesons
2 hours 30 minutes
On-Demand
Free-Video
Google Cloud Platform Beginner Series
0
rewiews
Comprehensive introduction to Google Cloud Platform, covering key technologies, hands-on exercises, and practical applications for beginners in cloud computing.
Add to list
18
Lesons
6 hours
On-Demand
Free-Video
Confluent
Building Data Pipelines with Apache Kafka and Confluent
0
rewiews
Learn to build streaming data pipelines using Apache Kafka® and Confluent. Covers data ingestion, real-time transformation with ksqlDB, and data egress. Hands-on tutorials included for practical experience.
Add to list
13
Lesons
1 hour 30 minutes
On-Demand
Free-Video
Amigoscode
Kafka Tutorial - Spring Boot Microservices
0
rewiews
Learn to integrate Apache Kafka with Spring Boot microservices, covering topics, producers, consumers, and building a RESTful API that interacts with the Kafka ecosystem.
Add to list
14
Lesons
51 minutes
On-Demand
Free-Video
Microsoft
Azure Data Factory Power Hour - Latest Updates and Demos
0
rewiews
Exclusive Azure Data Factory updates and demos, including Power Query integration, managed virtual networks, Teams notifications, and user-assigned managed identities for enhanced data integration capabilities.
Add to list
8
Lesons
1 hour 1 minute
On-Demand
Free-Video
Microsoft
Smart Data Pipelines to Azure - Ingesting and Migrating Data the DataOps Way
0
rewiews
Explore StreamSets' DataOps platform for optimizing data ingestion and migration to Azure. Learn about smart data pipelines, integration with Azure services, and how to speed up cloud adoption using modern data engineering practices.
Add to list
12
Lesons
30 minutes
On-Demand
Free-Video
Anyscale
Building an Agentic Framework for Generative AI Applications
0
rewiews
Discover how to build an agentic framework for GenAI applications, focusing on RAG-centric approaches, multilingual queries, and real-time data processing for enhanced customer assistance.
Add to list
1
Lesons
32 minutes
On-Demand
Free-Video
The ASF
Cloud-Native Solutions and Practices for Clickstream Data Analysis
0
rewiews
Construye un sistema de análisis de datos de flujo autónomo y rentable utilizando prácticas nativas de la nube y Apache Kafka para optimizar productos y potenciar el crecimiento empresarial.
Add to list
1
Lesons
36 minutes
On-Demand
Free-Video
Data Science Festival
The Challenges of Engineering Data Science
0
rewiews
Explore the unique challenges and perspectives of engineering in data science through insights from industry experts.
Add to list
1
Lesons
17 minutes
On-Demand
Free-Video
CMU Database Group
Accelerating Data and AI with Spice.ai Open-Source Software
0
rewiews
Discover how Spice.ai's open-source software accelerates data and AI development, exploring innovative approaches to streamline machine learning workflows and enhance database integration.
Add to list
1
Lesons
1 hour 6 minutes
On-Demand
Free-Video
OSACon
ETL with Meltano and Singer in the LLM Era
0
rewiews
Explore how traditional ETL tools like Singer and Meltano can enhance data pipeline management for LLM applications, addressing production-level challenges beyond basic LangChain capabilities.
Add to list
1
Lesons
30 minutes
On-Demand
Free-Video
OSACon
Real-Time Revolution: Kickstarting Your Journey in Streaming Data
0
rewiews
Dive into stream processing fundamentals and learn to build real-time data applications using Python's Bytewax framework, debunking common misconceptions along the way.
Add to list
1
Lesons
24 minutes
On-Demand
Free-Video
OSACon
Building a ChatGPT Data Pipeline with RisingWave Stream Processor and Astra Vector Search
0
rewiews
Discover how to build real-time GenAI pipelines by combining RisingWave's stream processing with Astra's vector embedding and similarity search capabilities for ChatGPT applications.
Add to list
1
Lesons
27 minutes
On-Demand
Free-Video
OSACon
The Real Modern Data Stack - Building ETL Pipelines with Open Source Tools
0
rewiews
Discover how to build powerful, cost-effective ETL data stacks using open-source tools like sling, dbt, duckdb, and dagster - transforming months of implementation into rapid deployment.
Add to list
1
Lesons
21 minutes
On-Demand
Free-Video
Open Data Science
ODSC West 2015 - Cloud Native Data Science
0
rewiews
Cloud-native design principles for data science applications: creating scalable, stateless models easily deployable in modern cloud systems. Insights on open-source platforms and best practices.
Add to list
13
Lesons
27 minutes
On-Demand
Free-Video
Databricks
Funnel Analysis with Apache Spark and Druid for Advertising Campaign Effectiveness
0
rewiews
Learn to perform funnel analysis at scale using Apache Spark, Druid, and DataSketches. Discover techniques for measuring campaign effectiveness and analyzing user behavior in chronological order for large-scale advertising campaigns.
Add to list
24
Lesons
26 minutes
On-Demand
Free-Video
Databricks
Introduction, Principles and Origin of Dagster
0
rewiews
Introduction to Dagster: A data orchestrator for the entire application lifecycle. Covers principles, development, deployment, and monitoring. Includes demo and code snippets showcasing Dagit UI and Dagster programming model.
Add to list
11
Lesons
30 minutes
On-Demand
Free-Video
Databricks
Empowering Developers with Self-Service ETL - Zillow's Approach
0
rewiews
Zillow's self-service ETL solutions empower teams to build, maintain, and monitor data pipelines, abstracting complex processes and catering to diverse user needs while leveraging internal services for efficient implementation.
Add to list
17
Lesons
24 minutes
On-Demand
Free-Video
Databricks
Code Once, Use Often - Declarative Data Pipelines
0
rewiews
Learn to build efficient, reusable data pipelines using declarative approaches. Explore techniques for reducing food waste through data-driven solutions and automation in this developer-focused session.
Add to list
21
Lesons
28 minutes
On-Demand
Free-Video
Databricks
Advancing GPU Analytics with RAPIDS Accelerator for Apache Spark and Alluxio
0
rewiews
Learn to accelerate GPU analytics using RAPIDS for Apache Spark and Alluxio, enabling faster data access and processing for AI and analytics workloads across any cloud environment.
Add to list
15
Lesons
27 minutes
On-Demand
Free-Video
Databricks
Make Reliable ETL Easy on Delta Lake
0
rewiews
Learn to simplify ETL development on Delta Lake, improving data quality and scaling operations. Discover how Databricks streamlines the ETL lifecycle for efficient data engineering and management.
Add to list
14
Lesons
45 minutes
On-Demand
Free-Video
Databricks
Delta Lake Streaming - Internals and Query Progress Logs
0
rewiews
Explore Delta Lake streaming internals, focusing on structured streaming components, Query Progress Logs, and checkpoint directories for efficient real-time data pipeline management.
Add to list
9
Lesons
29 minutes
On-Demand
Free-Video
Databricks
Building Data Quality Pipelines with Apache Spark and Delta Lake
0
rewiews
Learn to build robust data quality pipelines using Apache Spark and Delta Lake. Explore rule templates, reporting models, and PowerBI integration for effective data remediation and self-healing in enterprise environments.
Add to list
8
Lesons
27 minutes
On-Demand
Free-Video
Databricks
Building End-to-End Delta Pipelines on GCP
0
rewiews
Explore building end-to-end Delta pipelines on GCP, covering data reliability, performance, and the Bronze-Silver-Gold architecture pattern, with practical code examples and Big Query integration.
Add to list
11
Lesons
27 minutes
On-Demand
Free-Video
Databricks
Engagement Activity Delta Lake for Einstein Analytics and Sales Cloud Einstein
0
rewiews
Explore how Salesforce built an engagement activity platform using Delta Lake to support Einstein Analytics and Sales Cloud Einstein, covering data ingestion, incremental reads, and mutation handling.
Add to list
26
Lesons
51 minutes
On-Demand
Free-Video
Databricks
Getting Started with Apache Spark on Kubernetes
0
rewiews
Learn to deploy and optimize Apache Spark on Kubernetes with hands-on demos, covering environment setup, application sizing, performance tuning, and monitoring for efficient data pipeline management.
Add to list
10
Lesons
26 minutes
On-Demand
Free-Video
Databricks
Fugue: Unifying Big Data Analytics Ecosystems for ETL and Machine Learning
0
rewiews
Unifying big data ecosystems with Fugue: SQL-like framework for ETL and ML pipelines, compatible with Spark, TensorFlow, and more. Simplifies development, improves performance, and enhances maintainability.
Add to list
13
Lesons
22 minutes
On-Demand
Free-Video
Databricks
From Idea to Model: Productionizing Data Pipelines with Apache Airflow
0
rewiews
Learn to productionize data pipelines using Apache Airflow, bridging data science and engineering. Explore collaborative workflows, custom operators, and scalable solutions for efficient model deployment and iteration.
Add to list
17
Lesons
22 minutes
On-Demand
Free-Video
Databricks
Observability for Data Pipelines with OpenLineage
0
rewiews
Explore data pipeline observability using OpenLineage, enhancing reliability and auditability. Learn how metadata collection enables understanding of data flow and dependencies across teams and technologies.
Add to list
5
Lesons
24 minutes
On-Demand
Free-Video
Databricks
Dumb-Proofing Data Pipelines: Techniques for Configurable and Maintainable ETL - Databricks
0
rewiews
Techniques to create configurable, maintainable data pipelines. Learn to externalize configurations, validate inputs, and leverage Scala features for robust, easily deployable ETL processes.
Add to list
14
Lesons
22 minutes
On-Demand
Free-Video
Databricks
Fully Utilizing Spark for Data Validation with Fugue and Pandera
0
rewiews
Explore lightweight data validation for Spark using Fugue and Pandera, enabling partition-specific rules and efficient big data processing without compromising on functionality.
Add to list
12
Lesons
22 minutes
On-Demand
Free-Video
Databricks
Delight - Improved Apache Spark UI for Performance Troubleshooting
0
rewiews
Explore Delight, a free monitoring dashboard for Apache Spark. Learn to troubleshoot and optimize data engineering pipelines using system metrics and Spark information for improved performance and cost-effectiveness.
Add to list
5
Lesons
21 minutes
On-Demand
Free-Video
Databricks
Data Quality Tools Comparison for Continuous Data Imports
0
rewiews
Comparison of open-source data quality tools for continuous imports, covering maturity, documentation, extensibility, and features like data profiling and anomaly detection.
Add to list
17
Lesons
28 minutes
On-Demand
Free-Video
Databricks
Managing Millions of Tests Using Databricks - Automated Monitoring and Reporting System
0
rewiews
Explore automated test monitoring and reporting for Databricks Runtime using Delta, analyzing results from various sources to track quality and efficiently report failures to owners.
Add to list
14
Lesons
25 minutes
On-Demand
Free-Video
Databricks
Multi-Table Transactions with LakeFS and Delta Lake - Tech Talk
0
rewiews
Explore multi-table transactions using LakeFS and Delta Lake for collaborative data lake management, enabling CI/CD deployment and simplified multi-table pipelines in cloud storage systems.
Add to list
21
Lesons
45 minutes
On-Demand
Free-Video
Databricks
Composable Data Processing with Apache Spark - Scaling Development and Error Handling
0
rewiews
Learn about SIP, an extensible plugin framework for Apache Spark that enhances data processing efficiency, error handling, and scalability in Adobe's Experience Platform.
Add to list
5
Lesons
28 minutes
On-Demand
Free-Video
Databricks
Deep Learning Pipelines for High Energy Physics Using Apache Spark and Distributed Keras
0
rewiews
Explore CERN's Apache Spark-based data pipeline for deep learning in High Energy Physics. Learn about distributed training of neural network classifiers using BigDL and Analytics Zoo for improved event filtering at LHC experiments.
Add to list
30
Lesons
39 minutes
On-Demand
Free-Video
Databricks
Best Practices for Building and Deploying Data Pipelines in Apache Spark
0
rewiews
Learn best practices for building and deploying efficient, reproducible data pipelines in Apache Spark. Discover a toolkit that simplifies pipeline creation for non-engineers and explore deployment strategies.
Add to list
19
Lesons
41 minutes
On-Demand
Free-Video
Databricks
How to Automate Performance Tuning for Apache Spark
0
rewiews
Discover techniques to automate Apache Spark performance tuning, addressing common issues and exploring tools for efficient, scalable data pipeline maintenance in production environments.
Add to list
19
Lesons
41 minutes
On-Demand
Free-Video
Databricks
Simplify and Scale Data Engineering Pipelines with Delta Lake
0
rewiews
Learn to build scalable data engineering pipelines using Delta Lake's multi-hop architecture. Discover how to progress from raw data ingestion to machine learning-ready datasets efficiently and effectively.
Add to list
23
Lesons
38 minutes
On-Demand
Free-Video
Databricks
Best Practices for Building Robust Data Platforms with Apache Spark and Delta
0
rewiews
Learn best practices for building robust data platforms using Apache Spark and Delta. Gain insights on optimizing performance, scaling, security, and cost-effectiveness in big data architectures from real-world experiences.
Add to list
13
Lesons
27 minutes
On-Demand
Free-Video
Databricks
Generative Hyperloop Design - Managing Massively Scaled Simulations for Demand Modeling
0
rewiews
Explore large-scale simulation framework for Hyperloop design, utilizing cloud computing, data pipelines, and machine learning to address key technical and business questions for safe and efficient mass transportation.
Add to list
12
Lesons
26 minutes
On-Demand
Free-Video
Databricks
Real-Time Forecasting at Scale Using Delta Lake and Delta Caching
0
rewiews
Explore real-time forecasting at scale using Delta Lake and Delta Caching. Learn efficient data sampling, storage, and caching techniques for handling massive datasets and achieving rapid forecast response times.
Add to list
14
Lesons
25 minutes
On-Demand
Free-Video
Databricks
Building the Petcare Data Platform with Delta Lake and Spark ETL Pipeline
0
rewiews
Explore Mars Petcare's data platform using Delta Lake and Spark ETL pipeline 'Kyte'. Learn about advantages over Azure Data Factory and leveraging Delta Lake for ETL configurations and data science.
Add to list
7
Lesons
27 minutes
On-Demand
Free-Video
Open Data Science
Reliable Pipelines and High Quality Data Without the Toil - Kyle Kirwan
0
rewiews
Explore data observability's power in enhancing pipeline quality, reducing maintenance toil, and optimizing performance. Learn key components: metrics, data, lineage, and alerts for effective data management.
Add to list
7
Lesons
26 minutes
On-Demand
Free-Video
Linux Foundation
Fluentd - A Complete Logging Ecosystem for Kubernetes
0
rewiews
Explore Fluentd's internals, best practices, SDKs, and sub-projects for efficient logging in cloud-native environments, with a focus on Kubernetes integration and operational challenges.
Add to list
1
Lesons
34 minutes
On-Demand
Free-Video
Open Data Science
Orchestrating Data Assets Instead of Tasks, With Dagster - Sandy Ryza
0
rewiews
Explore data orchestration with Dagster, focusing on managing data assets over tasks. Learn about pipeline development, testing, deployment, and monitoring in data engineering and machine learning.
Add to list
12
Lesons
31 minutes
On-Demand
Free-Video
Linux Foundation
The Future of Data Pipelines with Atomic Wasm Transformations - Evolving Role of Data Engineers
0
rewiews
Explore atomic Wasm transformations in data pipelines and their impact on data engineering roles, focusing on innovative approaches to data processing and workflow optimization.
Add to list
1
Lesons
44 minutes
On-Demand
Free-Video
Linux Foundation
Building an Open Source Streaming Analytics Stack with Kafka and Druid
0
rewiews
Learn to build a robust streaming analytics stack using Kafka and Druid for real-time data processing, flexible querying, and maintaining data integrity in high-volume environments.
Add to list
24
Lesons
41 minutes
On-Demand
Free-Video
Linux Foundation
Building Robust Streaming Data Pipelines with Apache Spark
0
rewiews
Learn to build robust streaming data pipelines using Apache Spark, Kafka, and Camel. Explore ETL challenges, data management, and lessons from running these technologies in Docker.
Add to list
15
Lesons
42 minutes
On-Demand
Free-Video
Linux Foundation
Deploying Fast Data Pipelines on Mesos and DC/OS
0
rewiews
Overview of Fast Data systems, their deployment on Mesos and DC/OS, and their impact on Big Data processing. Learn about the evolving landscape of data pipelines and efficient operational strategies.
Add to list
1
Lesons
39 minutes
On-Demand
Free-Video
Data Science Dojo
What Is a Data Engineer?
0
rewiews
Explore data engineering: goals, skills, tools, and its role in the data ecosystem. Learn how data engineers support data scientists and drive business success through effective data management.
Add to list
28
Lesons
50 minutes
On-Demand
Free-Video
CNCF [Cloud Native Computing Foundation]
Building a Data Science Platform with Argo Workflows
0
rewiews
Discover how Bloomberg and Pipekit leverage Argo Workflows and Kubernetes to create scalable data science platforms, enabling data teams to automate pipelines and self-serve their data engineering needs across multiple clusters.
Add to list
1
Lesons
27 minutes
On-Demand
Free-Video
Open Data Science
Applied Reinforcement Learning for Online Ads and Recommender Systems
0
rewiews
Explore how Reinforcement Learning revolutionizes online advertising, from personalized content delivery to stochastic bandit algorithms, covering business impacts and technical challenges in real-world implementation.
Add to list
5
Lesons
43 minutes
On-Demand
Free-Video
CNCF [Cloud Native Computing Foundation]
Scheduling Jupyter Notebooks Using Airflow on Kubernetes
0
rewiews
Learn to schedule Jupyter notebooks using Airflow on Kubernetes, enabling scalable data pipelines and machine learning workflows with enhanced security and resource management.
Add to list
1
Lesons
28 minutes
On-Demand
Free-Video
Rust
Serverless Data Pipelines in Rust - Rust Vienna Meetup
0
rewiews
Explore building serverless data pipelines in Rust using DataFusion and object store, with deployment on AWS Lambda. Learn scalable techniques for complex pipelines in production environments.
Add to list
1
Lesons
41 minutes
On-Demand
Free-Video
CNCF [Cloud Native Computing Foundation]
The Data Pipelines Behind Forest Carbon Credits - Why Pachama Uses Flyte to Orchestrate Workflows
0
rewiews
Explore how Pachama uses Flyte to manage complex data and ML pipelines for transparent forest carbon credit computation, including workload management, configuration, and lessons learned.
Add to list
1
Lesons
34 minutes
On-Demand
Free-Video
Linux Foundation
Database Native Support for CDC Streaming Pipeline
0
rewiews
Explore a database-native solution for Cassandra's CDC logs, addressing common challenges and proposing an improved architecture based on industry insights and managed CDC solutions.
Add to list
1
Lesons
24 minutes
On-Demand
Free-Video
The ASF
New Features of Apache NiFi - Tips and Techniques
0
rewiews
Explore Apache NiFi's latest features, processors, and best practices. Build efficient data flows using cutting-edge techniques, with tips and guides for optimal implementation.
Add to list
1
Lesons
39 minutes
On-Demand
Free-Video
The ASF
Inference at Scale with Apache Beam
0
rewiews
Learn to deploy and scale machine learning models efficiently using Apache Beam for distributed inference on CPUs and GPUs, with practical insights on parallelizing workloads.
Add to list
1
Lesons
34 minutes
On-Demand
Free-Video
The ASF
Apache Arrow and Go: A Match Made in Data
0
rewiews
Explore Apache Arrow's Go implementation, covering Arrow and Parquet libraries, Flight server/client, C Data API integration, and performance optimization techniques.
Add to list
1
Lesons
42 minutes
On-Demand
Free-Video
All Things Open
Machine Learning with Apache Beam
0
rewiews
Explore Apache Beam for distributed data pipelines in machine learning, covering inference, data processing, and training with hands-on demos for parallelizing ML workloads.
Add to list
1
Lesons
36 minutes
On-Demand
Free-Video
NashKnolX
Ensuring Data Quality in Databricks with Great Expectations
0
rewiews
Integrate Great Expectations with Databricks to set, test, and enforce data pipeline expectations, ensuring accuracy and reliability in your data workflows.
Add to list
1
Lesons
47 minutes
On-Demand
Free-Video
NashKnolX
Delta Format and Live Tables - Pioneering Data Innovation
0
rewiews
Explore Delta format and Live tables, understanding their role in scalable big data pipelines and modern data engineering innovations.
Add to list
1
Lesons
36 minutes
On-Demand
Free-Video
NashKnolX
ETL Observability: Azure to Snowflake - Monitoring with Splunk and Power BI
0
rewiews
Explore ETL monitoring from Azure to Snowflake using ADF, Splunk, and Power BI. Learn data lineage and observability techniques for efficient pipeline management.
Add to list
1
Lesons
48 minutes
On-Demand
Free-Video
NashKnolX
Data Engineering with Databricks - Leveraging the Lakehouse Platform for ETL Pipelines
0
rewiews
Leverage Databricks Lakehouse Platform to build ETL pipelines, use Delta Live Tables, and orchestrate tasks with Workflows for efficient data processing.
Add to list
1
Lesons
44 minutes
On-Demand
Free-Video
NashKnolX
Introduction to Data Build Tool (DBT)
0
rewiews
Master data build tool (DBT) for efficient data product development, including CICD, pipelines, validation, quality testing, deployment, and monitoring.
Add to list
1
Lesons
59 minutes
On-Demand
Free-Video
Shaw Talebi
How to Build Data Pipelines for ML Projects with Python Code
0
rewiews
Learn to build data pipelines for machine learning projects using Python. Explore ETL vs ELT, extraction, transformation, loading, and orchestration. Includes practical example of processing YouTube video transcripts.
Add to list
10
Lesons
23 minutes
On-Demand
Free-Video
Shaw Talebi
Automating Data Pipelines with Python and GitHub Actions - Code Walkthrough
0
rewiews
Automate data pipelines using Python and GitHub Actions. Learn to create ETL scripts, set up repositories, configure workflows, and implement ML applications for efficient data processing.
Add to list
13
Lesons
31 minutes
On-Demand
Free-Video
Devoxx Poland
Accelerating Big Data - Modern Trends, Enable Product Analytics
0
rewiews
Explore modern trends in big data acceleration and enable effective product analytics for improved decision-making and business insights.
Add to list
1
Lesons
46 minutes
On-Demand
Free-Video
GeeCON Conference
Cloud Agnostic Data Platform with DBT, Trino, Iceberg and MinIO
0
rewiews
Explore a hands-on, cloud-agnostic data pipeline using TrinoDB, DBT, MinIO, and Apache Iceberg for scalable and lightweight analytics, suitable for both cloud and on-premise solutions.
Add to list
1
Lesons
34 minutes
On-Demand
Free-Video
EuroPython Conference
DBT and Python - How to Write Reusable and Testable Pipelines
0
rewiews
Develop reusable and testable data pipelines using DBT and Python. Learn best practices for version control, DAGs, and unit testing in SQL and Python-based models.
Add to list
1
Lesons
31 minutes
On-Demand
Free-Video
EuroPython Conference
From Pandas to Production: ELT with dlt
0
rewiews
Explore data load tool (dlt) to streamline data science workflows, overcome roadblocks, and transition smoothly from exploration to production. Learn best practices for data handling and managing failures with real-life examples.
Add to list
1
Lesons
27 minutes
On-Demand
Free-Video
Toronto Machine Learning Series (TMLS)
Simplifying Machine Learning Lifecycle Management in Healthcare
0
rewiews
Explore end-to-end ML lifecycle in healthcare using Azure services. Learn data ingestion, model creation, deployment, and visualization for real-time patient predictions and batch scoring.
Add to list
1
Lesons
2 hours 12 minutes
On-Demand
Free-Video
Toronto Machine Learning Series (TMLS)
MLOps Without Much Ops - Building Efficient Machine Learning Systems
0
rewiews
Discover modern, no-nonsense data pipelines for efficient machine learning systems. Learn PaaS advantages and explore real-world applications with open-source code. Gain insights on ML's future for organizations of all sizes.
Add to list
1
Lesons
1 hour 22 minutes
On-Demand
Free-Video
Data Science Conference
Retail Transactions at Scale for AI - From PoC to Production
0
rewiews
Discover how to scale AI solutions for retail transactions, from handling millions of unstructured data points to enabling enterprise-wide AI applications and reporting capabilities.
Add to list
1
Lesons
24 minutes
On-Demand
Free-Video
Databricks
Version Control for Lakehouse Architecture - Essential Practices and Benefits
0
rewiews
Discover how to implement data version control using lakeFS for improved data quality, reproducibility, and experimentation in Databricks data/ML pipelines, enhancing overall product value.
Add to list
1
Lesons
15 minutes
On-Demand
Free-Video
Databricks
Ocean Cleanup's Efforts to Remove Ocean Plastic - Leveraging Databricks
0
rewiews
Databricks supports Ocean Cleanup's mission to remove 90% of ocean plastic by 2040, enhancing computer vision models, developing impact reporting, and streamlining data architecture for more efficient operations.
Add to list
1
Lesons
31 minutes
On-Demand
Free-Video
Databricks
Building a Trusted Data Foundation for AI on Databricks
0
rewiews
Modernize data engineering for AI with Qlik's integration of Databricks AI Functions, automating real-time data pipelines to enhance AI-driven analysis and ML model development on the Databricks platform.
Add to list
1
Lesons
39 minutes
On-Demand
Free-Video
Databricks
Delta Live Tables in Depth - Best Practices for Intelligent Data Pipelines
0
rewiews
Explore best practices, new features, and future developments for Delta Live Tables with Databricks experts. Learn pipeline development, data quality, and serverless compute for intelligent data pipelines.
Add to list
1
Lesons
1 hour 8 minutes
On-Demand
Free-Video
Confluent
Data Pipelines Evolution from Batch to Streaming
0
rewiews
Explore the evolution from batch to streaming data pipelines using Apache Flink and Kafka. Learn strategies for seamless transition, including query-based connectors, change data capture, and handling late events.
Add to list
1
Lesons
41 minutes
On-Demand
Free-Video
Confluent
Streaming vs. Batch Processing: The Future of Data Architecture
0
rewiews
Explore the future of data processing with industry experts as they debate streaming vs. batch, adoption challenges, and best practices for real-time data architecture.
Add to list
8
Lesons
44 minutes
On-Demand
Free-Video
SQLBits
The Metadata-Driven Data Warehouse
0
rewiews
Streamline data warehouse development using metadata-driven approaches. Learn efficient techniques for Insert/Update/Delete operations and pipeline management.
Add to list
1
Lesons
45 minutes
On-Demand
Free-Video
SQLBits
Introduction to Databricks Delta Live Tables
0
rewiews
Accelerate data pipeline development with Databricks Delta Live Tables. Learn setup, ingestion, table creation, pipeline execution, and data validation for faster, more reliable projects.
Add to list
1
Lesons
50 minutes
On-Demand
Free-Video
load more...