Главная
Study mode:
on
You
History
Saved
In progress
0 courses
compleat
0 courses
#Art & Design
#Adobe
#ChatGPT
#GitHub
#Data Engineering
YouTube
education
#Data Pipelines
#ETL
#Data Ingestion
#Medallion Architecture
#Flyte
#ETL Pipelines
Showing:
970
courses
Sort by Relevancy
Highest rated
Lowest rated
Most recently added
NPTEL
Practical Machine Learning with Tensorflow
1
rewiews
We will cover the basics of Tensorflow and Machine Learning in the initial sessions and advanced topics in the latter part. After this course, the students will be able to build ML models using Tensorflow.
Add to list
34
Lesons
16 hours
On-Demand
Free-Video
MLOps.community
Implementing Data Capture for ML Observability and Drift Detection
0
rewiews
Explore the implementation of data capture for machine learning observability and drift detection in this 24-minute conference talk by Pushkar Garg. Dive into the complexities of modern ML systems, including data pipelines and transformations across mult…
Add to list
1
Lesons
24 minutes
On-Demand
Free-Video
MLOps.community
Going Beyond Two-Tier Data Architectures with DuckDB
0
rewiews
Explore the innovative data architectures enabled by DuckDB, an in-process analytical data management system, in this 32-minute talk by Prof. Dr. Hannes Mühleisen. Discover how DuckDB's lightweight yet powerful design, available under the MIT license, al…
Add to list
1
Lesons
32 minutes
On-Demand
Free-Video
MLOps.community
Building Data Infrastructure at Scale for AI/ML with Open Data Lakehouses
0
rewiews
Explore how data lakehouse architecture with Apache Hudi supports real-world predictive ML and vector-based AI use cases in this 30-minute keynote by Vinoth Chandar, creator of Apache Hudi. Learn about ingesting data with minute-level freshness, providin…
Add to list
1
Lesons
30 minutes
On-Demand
Free-Video
MLOps.community
Building Hyper-Personalized LLM Applications with Rich Contextual Data - DE4AI
0
rewiews
Explore the concept of Full RAG (Retrieval-Augmented Generation) and its potential to revolutionize user experiences across industries in this 28-minute talk by Mike Del Balso, co-founder of Tecton. Examine four levels of context personalization, from ba…
Add to list
1
Lesons
28 minutes
On-Demand
Free-Video
MLOps.community
The Daft Distributed Python Data Engine: Multimodal Data Curation at Any Scale
0
rewiews
Explore the Daft distributed Python data engine for multimodal data curation at any scale in this 27-minute talk by Jay Chia. Discover how Daft addresses the fundamental needs of ML/AI data platforms, including terabyte-scale ETL with complex model batch…
Add to list
1
Lesons
27 minutes
On-Demand
Free-Video
MLOps.community
Reproducible Data Science Over Data Lakes
0
rewiews
Explore reproducible data science techniques for data lakes in this 12-minute conference talk by Ciro Greco, presented at DE4AI. Delve into the challenges of achieving reproducibility in Lakehouse architectures and discover recent advancements made at Ba…
Add to list
1
Lesons
12 minutes
On-Demand
Free-Video
MLOps.community
Data Contracts and Observability: Complementary Approaches to Data Quality
0
rewiews
Explore the critical roles of data contracts and observability in ensuring data quality and reliability in this 14-minute talk by Mark Freeman. Gain insights into how these two approaches complement each other, with data contracts preventing known issues…
Add to list
1
Lesons
14 minutes
On-Demand
Free-Video
MLOps.community
Scaling Data Reliably: A Journey in Growing Through Data Pain Points
0
rewiews
Explore the challenges and solutions in scaling data systems reliably in this 16-minute conference talk by Miriah Peterson at MLOps.community. Delve into the concept of Data Downtime and its impact on business outcomes. Learn how Data Reliability Enginee…
Add to list
1
Lesons
16 minutes
On-Demand
Free-Video
MLOps.community
An Overview of Common ML Serving Architectures
0
rewiews
Explore common machine learning serving architectures in this 18-minute conference talk by Rebecca Taylor, tech lead of Personalization at Lidl e-commerce. Gain insights into the disconnect between academic teachings and industry practices in model deplo…
Add to list
1
Lesons
18 minutes
On-Demand
Free-Video
MLOps.community
Data Engineering for Streamlining the Data Science Developer Experience - DE4AI
0
rewiews
Explore the challenges and solutions in data engineering for optimizing the data science developer experience in this 12-minute talk by Aishwarya Joshi from Chime. Discover how to enable efficient feature engineering and deployment for low-latency infere…
Add to list
1
Lesons
12 minutes
On-Demand
Free-Video
MLOps.community
Partnering with Product for Effective Data Ingestion and Training Data
0
rewiews
Discover strategies for effective collaboration between data science and product teams in this 19-minute talk by Daniela Santisteban from MLOps.community. Learn how to leverage product management partnerships to ensure quality data ingestion and training…
Add to list
1
Lesons
19 minutes
On-Demand
Free-Video
MLOps.community
The Evolution of Lyft's ML Feature Store
0
rewiews
Explore the evolution of Lyft's machine learning feature store in this 13-minute conference talk presented by Devon Mittow, a Staff Data Engineer with over a decade of experience in data engineering across various industries. Gain insights into how Lyft'…
Add to list
1
Lesons
13 minutes
On-Demand
Free-Video
MLOps.community
AI-Powered Data Unification for Data Platforms
0
rewiews
Explore the intersection of AI and data platforms in this 13-minute conference talk by Dr. Shelby Heinecke, an AI research team leader at Salesforce. Delve into the critical operation of data unification and discover how small, efficient Large Language M…
Add to list
1
Lesons
13 minutes
On-Demand
Free-Video
MLOps.community
Putting the AI Back in Medallion Lake Design
0
rewiews
Explore a comprehensive analysis of lakehouse design and its integration with AI in this insightful 13-minute talk by Simon Whiteley, a Databricks Beacon and Microsoft MVP. Delve into the challenges companies face when adopting lakehouses and learn how t…
Add to list
1
Lesons
13 minutes
On-Demand
Free-Video
MLOps.community
DuckDB's Search Capabilities for AI and LLM Stacks
0
rewiews
Explore DuckDB's underrated search capabilities and its potential within an LLM stack in this 13-minute talk by Mehdi Ouazza, an experienced data engineer and developer relations lead at MotherDuck. Discover how DuckDB's versatile and efficient search me…
Add to list
1
Lesons
13 minutes
On-Demand
Free-Video
CodeWithYu
Realtime Streaming for Anomaly Detection - An End-to-End Data Engineering Project
0
rewiews
Build a real-time anomaly detection system using streaming data engineering techniques. Learn end-to-end implementation for monitoring and identifying unusual patterns in data streams.
Add to list
1
Lesons
1 hour 6 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
MLOps Platform Architecture for End-to-End ML Pipelines
0
rewiews
Explore Avast's MLOps journey, tooling, and cultural shift for enhanced ML productivity. Learn strategies for model tracking, storage, orchestration, and E2E deployments in large-scale ML pipelines.
Add to list
1
Lesons
42 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Building Scalable ML Services for Rapid Development in Health and Wellness
0
rewiews
Discover how to build and scale ML services for health and wellness products, reducing release time by 85% using open-source tools like DBT, AirFlow, and MLFlow.
Add to list
1
Lesons
27 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
ML Platform Beyond Kaggle Paradigm - Policy-Centric Approach for End-to-End Automation
0
rewiews
Explore a policy-centric approach to ML platforms, extending beyond model-centric views to enhance engineering productivity and directly improve business metrics.
Add to list
1
Lesons
24 minutes
On-Demand
Free-Video
Dynatrace
Anomaly Detection on 5 Pillars of Data Observability with Dynatrace Davis AI
0
rewiews
Explore Dynatrace Davis AI's anomaly detection for data observability pillars: freshness, distribution, volume, schema, and lineage. Learn automation techniques for alerting and monitoring data integrity.
Add to list
8
Lesons
34 minutes
On-Demand
Free-Video
GOTO Conferences
Building a CML Pipeline with Spark and Kafka - YOW! 2018
0
rewiews
Explore building a CML pipeline using Apache Spark and Kafka. Learn functional programming techniques for efficient data processing and machine learning workflows.
Add to list
1
Lesons
47 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Quality Assurance in Machine Learning: Enhancing Trust and Reliability
0
rewiews
Explore quality assurance in machine learning, its importance, and strategies for enhancing AI trustworthiness through lessons from other disciplines and best practices in MLOps and data science.
Add to list
1
Lesons
28 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
From Idea to Production: AI Infrastructure for Scaling LLM Applications
0
rewiews
Explore strategies for scaling LLM applications from beta to production, addressing challenges and building adaptable AI infrastructure for evolving models and workflows.
Add to list
1
Lesons
38 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Apache Airflow: Where Data Engineers and ML Engineers Meet
0
rewiews
Explore production-quality Generative AI pipelines using Apache Airflow. Learn to orchestrate workflows, integrate data engineering, and implement MLOps best practices for robust AI applications.
Add to list
1
Lesons
47 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Learn Your Codebase: Fine-tuning CodeLlama with Flyte to Learn Flyte
0
rewiews
Discover how to fine-tune LLMs on specific codebases using Flyte, an open-source orchestration platform. Explore multi-node, multi-gpu distributed training for efficient model adaptation with limited resources.
Add to list
1
Lesons
1 hour 27 minutes
On-Demand
Free-Video
Dynatrace
Dynatrace Business Events: Expanding Observability into Your Business Domain
0
rewiews
Expand observability into business domain with Dynatrace Business Events. Learn to capture, ingest, explore, and analyze critical business data using OneAgent, API, and DQL for enhanced business analytics.
Add to list
9
Lesons
33 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Parallelizing Your ETL with Dask on Kubeflow
0
rewiews
Leverage Dask's parallelism capabilities on Kubeflow for advanced ETL processing. Learn to use the Dask Operator to optimize resource utilization in interactive Jupyter sessions and pipeline workflows.
Add to list
1
Lesons
1 hour 48 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Building Real Time ML Pipelines with a Feature Store
0
rewiews
Explore real-time ML pipeline construction using feature stores. Learn to optimize compute efficiency and enhance ML workflows for production environments.
Add to list
1
Lesons
31 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
How to Treat Your Data Platform Like a Product - 5 Key Best Practices
0
rewiews
Discover 5 key best practices for treating your data platform as a product, ensuring reliability and scalability for maximum business value.
Add to list
1
Lesons
30 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Implementing SecMLOps at Every Stage of the ML Pipeline
0
rewiews
Implement SecMLOps at every stage of the ML pipeline to address security challenges, comply with data privacy laws, and ensure robust ML system architecture.
Add to list
17
Lesons
42 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Lessons Learned from DAG-Based Workflow Orchestration
0
rewiews
Explore DAG-based workflow orchestration challenges and learn about Prefect Orion's DAG-less system for enhanced runtime flexibility and developer experience in data pipelines.
Add to list
1
Lesons
39 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
UnionML: A Microframework for Building Machine Learning Applications
0
rewiews
Explore UnionML, a microframework simplifying ML application development. Learn to streamline the process from research to production, addressing common challenges in the ML lifecycle.
Add to list
1
Lesons
27 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Building Real-Time ML Features with a Feature Platform
0
rewiews
Explore challenges in deploying ML pipelines and learn how feature stores and platforms solve data problems for production ML, enabling real-time processing and scaling.
Add to list
1
Lesons
34 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
The Critical Things You Have to Build to Transform Your Company to be ML Driven
0
rewiews
Discover key components for transforming your company into an ML-driven organization. Learn strategies for overcoming challenges in the machine learning development lifecycle and deploying models effectively.
Add to list
1
Lesons
39 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Becoming an ML Platform Power Builder with ML Observability
0
rewiews
Discover best practices for building effective ML platforms. Learn key components, justification strategies, and how ML infrastructure differs from software infrastructure.
Add to list
1
Lesons
46 minutes
On-Demand
Free-Video
MLOps World: Machine Learning in Production
Airflow: Where Data Engineers and ML Engineers Meet
0
rewiews
Explore Apache Airflow's role in operational machine learning with a real-world demo and reference implementation of GenAI in production.
Add to list
1
Lesons
10 minutes
On-Demand
Free-Video
The Machine Learning Engineer
GraphRAG - Unleashing the Power of Knowledge Graphs with LLMs
0
rewiews
Explore GraphRAG: AI-powered content interpretation using LLMs to create knowledge graphs from unstructured text, enabling advanced search and question-answering capabilities.
Add to list
1
Lesons
1 hour 1 minute
On-Demand
Free-Video
Anyscale
ByteDance's Platform for Reinforcement Learning from Human Feedback
0
rewiews
Explore ByteDance's journey in building a video data processing pipeline using Ray's ecosystem for creating realistic video generation models from text instructions.
Add to list
1
Lesons
32 minutes
On-Demand
Free-Video
Apache Airflow Tutorials
0
rewiews
Learn Apache Airflow fundamentals, set up environments, write pipelines, and integrate with Google Cloud services. Gain practical skills for efficient workflow automation and data pipeline management.
Add to list
7
Lesons
2 hours 30 minutes
On-Demand
Free-Video
Apache Airflow 101
0
rewiews
Learn to programmatically author, schedule, and monitor workflows using Apache Airflow. Covers installation, DAGs, operators, and hands-on examples with MySQL and email integration.
Add to list
14
Lesons
2 hours 30 minutes
On-Demand
Free-Video
Apache Airflow Tutorials
0
rewiews
Aprende a instalar y usar Apache Airflow con operadores Spark, Hive y Sqoop en Windows, con tutoriales en inglés y tamil.
Add to list
9
Lesons
2 hours 30 minutes
On-Demand
Free-Video
Cloudera
Cloudera How To Videos
0
rewiews
Comprehensive video series covering Cloudera's data platform, including security, governance, analytics, deployment, and management topics for enterprise-level data solutions.
Add to list
33
Lesons
2 hours 30 minutes
On-Demand
Free-Video
Great Learning
Big Data Analytics for Beginners
0
rewiews
Comprehensive introduction to big data analytics, covering key concepts like Hadoop, Apache Spark, and ETL. Ideal for beginners seeking to understand the landscape and value chain of big data.
Add to list
8
Lesons
1 hour 11 minutes
On-Demand
Free-Video
Google Cloud Platform Beginner Series
0
rewiews
Comprehensive introduction to Google Cloud Platform, covering key technologies, hands-on exercises, and practical applications for beginners in cloud computing.
Add to list
18
Lesons
6 hours
On-Demand
Free-Video
Confluent
Building Data Pipelines with Apache Kafka and Confluent
0
rewiews
Learn to build streaming data pipelines using Apache Kafka® and Confluent. Covers data ingestion, real-time transformation with ksqlDB, and data egress. Hands-on tutorials included for practical experience.
Add to list
13
Lesons
1 hour 30 minutes
On-Demand
Free-Video
Confluent
Data Mesh and Data Domains Tutorials - Data Mesh 101
0
rewiews
Comprehensive tutorial on data mesh architecture, covering its principles, implementation, and practical applications in modern data analytics, with hands-on examples using Confluent's platform.
Add to list
10
Lesons
1 hour 30 minutes
On-Demand
Free-Video
Confluent
KSQLDB Videos by Confluent
0
rewiews
Comprehensive video series on ksqlDB, covering demos, integrations, use cases, installation, core concepts, data manipulation, and production deployment for stream processing with Apache Kafka.
Add to list
21
Lesons
3 hours 30 minutes
On-Demand
Free-Video
Amigoscode
Kafka Tutorial - Spring Boot Microservices
0
rewiews
Learn to integrate Apache Kafka with Spring Boot microservices, covering topics, producers, consumers, and building a RESTful API that interacts with the Kafka ecosystem.
Add to list
14
Lesons
51 minutes
On-Demand
Free-Video
Microsoft
Citus Cluster Zero-Downtime Migration - Algolia's Experience
0
rewiews
Algolia engineers share their zero-downtime migration strategy from Citus Cloud to Azure, detailing planning, execution, and key takeaways for seamless analytics infrastructure transition.
Add to list
9
Lesons
28 minutes
On-Demand
Free-Video
Microsoft
Azure Data Factory Power Hour - Latest Updates and Demos
0
rewiews
Exclusive Azure Data Factory updates and demos, including Power Query integration, managed virtual networks, Teams notifications, and user-assigned managed identities for enhanced data integration capabilities.
Add to list
8
Lesons
1 hour 1 minute
On-Demand
Free-Video
Microsoft
High-Level Introduction to MLOps - AI Show Episode 33
0
rewiews
Microsoft data scientists discuss MLOps challenges, principles, and best practices, offering insights on successful implementation and tips for overcoming common obstacles in AI projects.
Add to list
13
Lesons
34 minutes
On-Demand
Free-Video
Microsoft
Smart Data Pipelines to Azure - Ingesting and Migrating Data the DataOps Way
0
rewiews
Explore StreamSets' DataOps platform for optimizing data ingestion and migration to Azure. Learn about smart data pipelines, integration with Azure services, and how to speed up cloud adoption using modern data engineering practices.
Add to list
12
Lesons
30 minutes
On-Demand
Free-Video
The AI University
Data Engineering Full Hands-on Course
0
rewiews
Comprehensive hands-on introduction to data engineering, covering data lake architecture, Cloudera tools, Apache Sqoop, and practical HDFS and MySQL operations.
Add to list
7
Lesons
1 hour 30 minutes
On-Demand
Free-Video
Data Science Dojo
Putting MLOps into Practice
0
rewiews
Streamline your machine learning lifecycle with MLOps best practices, from project initiation to model deployment and monitoring. Learn practical implementation strategies and reference architectures.
Add to list
10
Lesons
47 minutes
On-Demand
Free-Video
SAP
Strategy and Latest Updates for Data Integration at SAP
0
rewiews
Explore SAP's data integration strategy, focusing on SAP Data Intelligence's role in orchestrating complex data landscapes and powering the Business Technology Platform for effective hybrid data management.
Add to list
20
Lesons
45 minutes
On-Demand
Free-Video
Anyscale
Building an Agentic Framework for Generative AI Applications
0
rewiews
Discover how to build an agentic framework for GenAI applications, focusing on RAG-centric approaches, multilingual queries, and real-time data processing for enhanced customer assistance.
Add to list
1
Lesons
32 minutes
On-Demand
Free-Video
The ASF
Building a Lakehouse with Apache Hudi - Data Architecture at Kuaishou
0
rewiews
Discover how Kuaishou Inc implements Apache Hudi to build a robust lakehouse architecture, exploring real-world applications and technical insights for large-scale data management.
Add to list
1
Lesons
48 minutes
On-Demand
Free-Video
CNCF [Cloud Native Computing Foundation]
Running Remote Shuffle Service to Solve Apache Spark's Dynamic Resource Allocation Challenge on Kubernetes
0
rewiews
探讨利用远程洗牌服务解决Apache Spark在Kubernetes上的动态资源分配挑战,实现存储与计算解耦,提高大数据处理的可靠性和可扩展性。
Add to list
1
Lesons
54 minutes
On-Demand
Free-Video
BasisTech
Data Engineering and Data Science Platform Based on Hadoop/Spark
0
rewiews
Hadoop/Sparkを中心とした大規模データ活用基盤の構築と運用。SQL-on-Hadoop、Spark、Pythonを用いたデータエンジニアリングとデータサイエンスの実践的アプローチを紹介。
Add to list
1
Lesons
35 minutes
On-Demand
Free-Video
Ubuntu OnAir
Crea tu Propio Laboratorio de Big Data con MicroK8s y Spark
0
rewiews
Aprende a crear un laboratorio de Big Data utilizando Spark y MicroK8s. Descubre cómo emular una arquitectura de datos y practica habilidades esenciales para el análisis de grandes volúmenes de información.
Add to list
1
Lesons
57 minutes
On-Demand
Free-Video
PyCon US
SQL está en todas partes: Explorando alternativas con Ibis
0
rewiews
Descubre Ibis: una biblioteca Python para consultas de datos eficientes en múltiples backends. Simplifica el trabajo con bases de datos SQL y ofrece una alternativa a las complejas consultas tradicionales.
Add to list
1
Lesons
27 minutes
On-Demand
Free-Video
PyCon US
Ingeniería de Datos para la Salud Mental con Calabaza_bot - Charla de Sergio Sanchez
0
rewiews
Descubre cómo crear un bot de Telegram para seguimiento de salud mental usando tecnologías open source. Aprende a integrar herramientas como AWS, OpenAI Whisper y dbt para análisis de bienestar personal.
Add to list
1
Lesons
31 minutes
On-Demand
Free-Video
The ASF
Cloud-Native Solutions and Practices for Clickstream Data Analysis
0
rewiews
Construye un sistema de análisis de datos de flujo autónomo y rentable utilizando prácticas nativas de la nube y Apache Kafka para optimizar productos y potenciar el crecimiento empresarial.
Add to list
1
Lesons
36 minutes
On-Demand
Free-Video
BSidesLV
Security Data Science Teams: A Guide to Prestige Classes
0
rewiews
Explore data-driven security roles, their overlapping skills, and career paths in this talk on the evolving landscape of security data science teams and job titles.
Add to list
1
Lesons
51 minutes
On-Demand
Free-Video
Data Council
How Data Teams Can Contribute to Data Privacy
0
rewiews
Explore global data privacy regulations, dispel misconceptions, and learn advanced techniques for privacy-aware systems in data engineering and management.
Add to list
1
Lesons
26 minutes
On-Demand
Free-Video
Security BSides San Francisco
Reinventing ETL for Detection and Response Teams
0
rewiews
Explore innovative strategies for efficient, cost-effective data collection and log enrichment tailored to Detection and Response teams' unique requirements in cybersecurity.
Add to list
1
Lesons
29 minutes
On-Demand
Free-Video
Databricks
AI-Powered EDR: Streamlining Blackberry Cybersecurity with Databricks
0
rewiews
Discover how Databricks optimized Blackberry's EDR system, improving incident detection speed, reducing query latency by 20%, and cutting costs by 30% while enhancing data management and cybersecurity capabilities.
Add to list
1
Lesons
31 minutes
On-Demand
Free-Video
Databricks
Ensuring GDPR Compliance for Real-Time Data Pipelines
0
rewiews
Strategies for GDPR compliance in real-time data pipelines using Databricks tools. Covers immediate solutions with Delta Lake's CDF and long-term plans for PII separation, offering practical privacy compliance approaches.
Add to list
1
Lesons
31 minutes
On-Demand
Free-Video
EuroPython Conference
Impersonation in Data Engineering: No More Credentials in Your Code
0
rewiews
Discover how to streamline data engineering workflows using IAM, Workload Identity, and impersonation. Eliminate credential hassles, improve security, and boost productivity in your development process.
Add to list
1
Lesons
24 minutes
On-Demand
Free-Video
SQLBits
Empowering Women in Data - Navigating Challenges, Seizing Opportunities
0
rewiews
Explore strategies for women to thrive in data, gain insights on inclusivity initiatives, and learn from success stories in the field.
Add to list
1
Lesons
20 minutes
On-Demand
Free-Video
Data Science Festival
The Challenges of Engineering Data Science
0
rewiews
Explore the unique challenges and perspectives of engineering in data science through insights from industry experts.
Add to list
1
Lesons
17 minutes
On-Demand
Free-Video
CMU Database Group
Accelerating Data and AI with Spice.ai Open-Source Software
0
rewiews
Discover how Spice.ai's open-source software accelerates data and AI development, exploring innovative approaches to streamline machine learning workflows and enhance database integration.
Add to list
1
Lesons
1 hour 6 minutes
On-Demand
Free-Video
The ASF
The Evolution of Data Infrastructure in the Age of LLM
0
rewiews
Explore how Large Language Models are reshaping modern data infrastructure, examining key architectural changes and emerging best practices for scalable AI systems.
Add to list
1
Lesons
29 minutes
On-Demand
Free-Video
Anyscale
Supercharging ETL and Analytics with Ray and Daft
0
rewiews
Discover how Daft library enhances Ray clusters with distributed ETL capabilities, enabling seamless data processing, analytics, and ML/AI integration for scalable, high-performance workflows.
Add to list
1
Lesons
11 minutes
On-Demand
Free-Video
Data Con LA
BODi's Data Modernization Journey - Implementing Cloud Data Warehouses with Snowflake
0
rewiews
Discover how BODi modernized their data infrastructure by implementing Snowflake cloud warehouses, integrating siloed systems, and managing a complex 18-month migration of tables, pipelines, and applications.
Add to list
1
Lesons
45 minutes
On-Demand
Free-Video
OSACon
ETL with Meltano and Singer in the LLM Era
0
rewiews
Explore how traditional ETL tools like Singer and Meltano can enhance data pipeline management for LLM applications, addressing production-level challenges beyond basic LangChain capabilities.
Add to list
1
Lesons
30 minutes
On-Demand
Free-Video
OSACon
Real-Time Revolution: Kickstarting Your Journey in Streaming Data
0
rewiews
Dive into stream processing fundamentals and learn to build real-time data applications using Python's Bytewax framework, debunking common misconceptions along the way.
Add to list
1
Lesons
24 minutes
On-Demand
Free-Video
OSACon
Unlocking Scalable and Efficient Data Storage with Apache Ozone
0
rewiews
Discover how Apache Ozone revolutionizes distributed object storage, offering scalable solutions for managing massive data volumes while ensuring reliability, cost-effectiveness, and seamless integration with Hadoop ecosystem.
Add to list
1
Lesons
27 minutes
On-Demand
Free-Video
OSACon
Data Alchemy: Transforming Raw Data to Gold with Apache Hudi and DBT
0
rewiews
Discover how to build efficient medallion architecture using Apache Hudi's CDC feature and DBT for incremental data processing, enabling low-latency analytics and streamlined data transformation workflows.
Add to list
1
Lesons
29 minutes
On-Demand
Free-Video
OSACon
Navigating the Landscape of a Fully Open Source Data Stack in 2023
0
rewiews
Explore the architecture of modern data stacks, from warehousing to streaming solutions, while understanding the potential and limitations of open-source tools in today's data infrastructure.
Add to list
1
Lesons
27 minutes
On-Demand
Free-Video
OSACon
Building a ChatGPT Data Pipeline with RisingWave Stream Processor and Astra Vector Search
0
rewiews
Discover how to build real-time GenAI pipelines by combining RisingWave's stream processing with Astra's vector embedding and similarity search capabilities for ChatGPT applications.
Add to list
1
Lesons
27 minutes
On-Demand
Free-Video
OSACon
Where the Modern Data Stack Has Failed and Why Engineering-centric Tools Will Reshape the Data World
0
rewiews
Explore how engineering-focused tools like DuckDb, SDF, and Dagster are reshaping data engineering, addressing limitations of the modern data stack, and enabling more scalable, programmable solutions.
Add to list
1
Lesons
21 minutes
On-Demand
Free-Video
OSACon
Unveiling the Power of dbt and DuckDB - Hype vs Reality
0
rewiews
Explore the practical applications and limitations of combining dbt and DuckDB for data transformations, examining real-world use cases and performance considerations.
Add to list
1
Lesons
20 minutes
On-Demand
Free-Video
OSACon
The Real Modern Data Stack - Building ETL Pipelines with Open Source Tools
0
rewiews
Discover how to build powerful, cost-effective ETL data stacks using open-source tools like sling, dbt, duckdb, and dagster - transforming months of implementation into rapid deployment.
Add to list
1
Lesons
21 minutes
On-Demand
Free-Video
Open Data Science
Data Planning to Implementation
0
rewiews
Explore the journey from data planning to implementation, covering data strategy, maturity, roadmaps, and best practices for turning insights into actionable solutions across businesses.
Add to list
11
Lesons
33 minutes
On-Demand
Free-Video
Open Data Science
Streaming Featurization with Ibis, Substrait and Apache Arrow
0
rewiews
Explore real-time streaming data processing using Ibis, Substrait, and Apache Arrow. Learn how this software stack enhances featurization workflows for fast, accurate insights and decision-making in data science.
Add to list
10
Lesons
31 minutes
On-Demand
Free-Video
Open Data Science
Getting Into Data Engineering
0
rewiews
Explore data engineering career paths, industry changes, and essential skills with Joe Reis. Gain insights on transitioning from software engineering or data analysis roles and receive valuable career advice.
Add to list
13
Lesons
33 minutes
On-Demand
Free-Video
Open Data Science
ODSC West 2015 - Cloud Native Data Science
0
rewiews
Cloud-native design principles for data science applications: creating scalable, stateless models easily deployable in modern cloud systems. Insights on open-source platforms and best practices.
Add to list
13
Lesons
27 minutes
On-Demand
Free-Video
Open Data Science
How Companies Are Using Tachyon, a Memory Centric Distributed Storage
0
rewiews
Explore Tachyon, a memory-centric distributed storage system for fast big data processing. Learn about its benefits, use cases, and integration with popular frameworks like Spark and Hadoop.
Add to list
13
Lesons
22 minutes
On-Demand
Free-Video
Open Data Science
Applying Engineering Best Practices in Data Lakes Architectures - Einat Orr
0
rewiews
Explore engineering best practices for data lakes architectures, covering data infrastructure evolution, building blocks of data products, and practical tooling examples for effective data engineering.
Add to list
9
Lesons
27 minutes
On-Demand
Free-Video
Prodramp
Mastering Fake Business Data Creation in Python With Faker
0
rewiews
Learn to generate realistic fake business data using Python's Faker library. Covers standard and community providers, creating diverse datasets, and integrating with Pandas for analysis.
Add to list
19
Lesons
31 minutes
On-Demand
Free-Video
Prodramp
How You Can Build Your Own Worldwide Wildfire Data Collection and Host on Kaggle
0
rewiews
Learn to collect, process, and visualize global wildfire data using NASA sources, data engineering techniques, and tools like Kepler.gl, mapboxgl, and Streamlit for insightful analysis and presentation.
Add to list
19
Lesons
1 hour 11 minutes
On-Demand
Free-Video
Prodramp
Modern and Postmodern Data Stack - Architecture and Why You Need It
0
rewiews
Explore modern and postmodern data stack architectures, their features, and importance in data engineering. Learn why organizations need to evolve their data infrastructure for improved analytics and decision-making.
Add to list
8
Lesons
18 minutes
On-Demand
Free-Video
Prodramp
Data Engineering and Storage Concepts - With ETL, ELT, Warehouse, Lake, and Lake House
0
rewiews
Comprehensive overview of data engineering concepts, including storage solutions, ETL/ELT processes, and modern data architectures like data mesh, with practical examples and best practices for effective data management.
Add to list
19
Lesons
27 minutes
On-Demand
Free-Video
Databricks
Funnel Analysis with Apache Spark and Druid for Advertising Campaign Effectiveness
0
rewiews
Learn to perform funnel analysis at scale using Apache Spark, Druid, and DataSketches. Discover techniques for measuring campaign effectiveness and analyzing user behavior in chronological order for large-scale advertising campaigns.
Add to list
24
Lesons
26 minutes
On-Demand
Free-Video
Databricks
Introduction, Principles and Origin of Dagster
0
rewiews
Introduction to Dagster: A data orchestrator for the entire application lifecycle. Covers principles, development, deployment, and monitoring. Includes demo and code snippets showcasing Dagit UI and Dagster programming model.
Add to list
11
Lesons
30 minutes
On-Demand
Free-Video
Databricks
Automated Metadata Management in Data Lakes - A CI/CD Driven Approach
0
rewiews
Discover a CI/CD-driven approach to automated metadata management in data lakes, focusing on balancing development speed, governance, and schema evolution for rapidly growing organizations.
Add to list
10
Lesons
28 minutes
On-Demand
Free-Video
Databricks
Empowering Developers with Self-Service ETL - Zillow's Approach
0
rewiews
Zillow's self-service ETL solutions empower teams to build, maintain, and monitor data pipelines, abstracting complex processes and catering to diverse user needs while leveraging internal services for efficient implementation.
Add to list
17
Lesons
24 minutes
On-Demand
Free-Video
Databricks
Code Once, Use Often - Declarative Data Pipelines
0
rewiews
Learn to build efficient, reusable data pipelines using declarative approaches. Explore techniques for reducing food waste through data-driven solutions and automation in this developer-focused session.
Add to list
21
Lesons
28 minutes
On-Demand
Free-Video
load more...