#Model Optimization

Explore the future of large language models, discussing scaling challenges, sparse mixtures of experts, improved prompting techniques, and new forms of supervision for enhanced NLP performance.

Add to list

50 Lesons

56 minutes

On-Demand

Free-Video

EfficientML.ai Lecture - Introduction to Efficient Machine Learning

Dive into the world of efficient machine learning with this comprehensive lecture from MIT HAN Lab. Explore cutting-edge techniques and strategies for optimizing ML models, reducing computational costs, and improving overall performance. Learn from exper…

Add to list

1 Lesons

1 hour 31 minutes

On-Demand

Free-Video

Leaner, Greener and Faster PyTorch Inference with Quantization

Explore PyTorch quantization techniques to optimize neural networks, reducing size and improving speed without sacrificing accuracy. Learn implementation strategies and real-world applications.

Add to list

1 Lesons

1 hour 38 minutes

On-Demand

Free-Video

Efficiently Fine-Tune and Serve Your Own LLMs

Learn to efficiently fine-tune and deploy large language models, optimizing performance and resource utilization for production environments.

Add to list

1 Lesons

42 minutes

On-Demand

Free-Video

Beyond the Model Zoo - Optimizing Foundation Models for Your Application

Optimize foundation models for specific applications, exploring techniques beyond pre-trained models to enhance performance and efficiency in machine learning projects.

Add to list

1 Lesons

31 minutes

On-Demand

Free-Video

Using Reproducible Experiments to Create Better Machine Learning Models

Enhance ML model efficiency through reproducible experiments. Learn to track hyperparameter, code, and data changes using DVC for grid and random search tuning methods.

Add to list

1 Lesons

30 minutes

On-Demand

Free-Video

A Gentle Introduction to Sparsity with a Concrete Example

Explore sparsification techniques for ML models, including popular methods and a concrete example to understand their benefits.

Add to list

1 Lesons

26 minutes

On-Demand

Free-Video

Accelerating Transformers with Hugging Face Optimum and Infinity

Explore techniques to accelerate Transformer models using Hugging Face's Optimum library and Infinity solution for millisecond-scale latencies in production environments.

Add to list

1 Lesons

1 hour 28 minutes

On-Demand

Free-Video

Enhance Cost Efficiency in Domain Adaptation with PruneMe

Optimize LLMs through layer pruning, enabling cost-effective domain adaptation and model merging for improved performance in specific applications.

Add to list

1 Lesons

17 minutes

On-Demand

Free-Video

TinyEngine: Transformer and Large Language Models - Lecture 12

Dive into Transformer and LLM architectures, exploring their applications and optimizations for efficient machine learning on resource-constrained devices.

Add to list

1 Lesons

1 hour 17 minutes

On-Demand

Free-Video

Learn TensorFlow and Deep Learning Fundamentals with Python - Code-First Introduction Part 2/2

Dive deep into TensorFlow and Python, mastering non-linear models, multi-class classification, and advanced techniques for optimizing neural networks and evaluating their performance.

Add to list

24 Lesons

3 hours 58 minutes

On-Demand

Free-Video

Data Science Dojo

Text Analytics Crash Course with R

Learn text analytics fundamentals, data processing, machine learning models, and visualization techniques using R. Master handling big data and optimizing models for valuable insights from unstructured text.

Add to list

12 Lesons

6 hours 9 minutes

On-Demand

Free-Video

The Machine Learning Engineer

LLMOps: Como usar Nvidia TensorRT SDK para Inferencia en GPU

Optimiza la inferencia en GPU con Nvidia TensorRT SDK, convirtiendo modelos y comparando tiempos de ejecución en diferentes precisiones de datos.

Add to list

1 Lesons

40 minutes

On-Demand

Free-Video

Advanced Anomaly Detection Made Easy

Advanced anomaly detection techniques for embedded ML, focusing on custom DSP blocks, feature importance, and threshold optimization. Learn to implement these methods for various IoT applications using Edge Impulse.

Add to list

36 Lesons

1 hour

On-Demand

Free-Video

Alan Turing Institute

Reinforcement and Mean-Field Games in Algorithmic Trading - Sebastian Jaimungal

Explore reinforcement learning and mean-field games in algorithmic trading, covering deep Q-learning, reinforced deep Kalman filters, and heterogeneous agent interactions with differing beliefs in financial markets.

Add to list

16 Lesons

1 hour 14 minutes

On-Demand

Free-Video

AutoML Toolkit Deep Dive - Automating Feature Engineering and Model Optimization

Explore Databricks Labs' AutoML Toolkit for automating feature engineering, model selection, tuning, and deployment. Learn to accelerate machine learning workflows and enhance productivity.

Add to list

24 Lesons

1 hour

On-Demand

Free-Video

Augmenting Machine Learning with Databricks Labs AutoML Toolkit

Explore how Databricks Labs AutoML Toolkit simplifies and optimizes machine learning processes, from data preparation to model optimization, using financial loan risk data examples.

Add to list

16 Lesons

30 minutes

On-Demand

Free-Video

Simulation Lab

3D Printing with 3ds Max + TyFlow - Fastest Way to Clean Your Models

Learn efficient techniques to optimize 3D models for printing using TyFlow's VDB tools in 3ds Max. Discover how to create clean, water-tight meshes and avoid common printing issues.

Add to list

7 Lesons

26 minutes

On-Demand

Free-Video

Reallusion

3DXchange 6 Tutorial - Optimizing Sketchup Models for Architectural Rendering

Learn to optimize Sketchup models for rendering in iClone and Indigo. Covers techniques like excluding back faces, smoothing normals, creating sub-props, merging identical meshes, and setting pivot points for efficient customization.

Add to list

6 Lesons

26 minutes

On-Demand

Free-Video

Freedom Arts - 3D Animation & Game Developer

CC3 to Smile Game Builder - Maya Bone Reduction Method

Step-by-step tutorial on reducing bones in Maya for exporting CC3 characters to Smile Game Builder, with real-time demonstrations and helpful links for game development tools.

Add to list

1 Lesons

40 minutes

On-Demand

Free-Video

Pixologic ZBrush

3D Printing: Sculpting, Detailing, and Preparation for Resin Printing - Week 3

Learn advanced 3D printing techniques for resin models, including layered sculpting, detail optimization, and print preparation. Gain insights on client approaches and industry trends.

Add to list

51 Lesons

1 hour 24 minutes

On-Demand

Free-Video

Pixologic ZBrush

ZBrush 2024 - Cosmo Armor 3D Printing Preparation Tutorial

Explore ZBrush 2024 techniques for 3D printing cosplay armor with Ian Robinson. Learn about model preparation, detailing, and optimization for successful costume creation.

Add to list

45 Lesons

1 hour 53 minutes

On-Demand

Free-Video

Pixologic ZBrush

3D Printing Post-Production: Exporting and Slicing - Week 4

Explore advanced 3D printing techniques, from ZBrush export to Lychee Slicer optimization. Learn model preparation, support structures, hollowing, and troubleshooting for successful prints.

Add to list

58 Lesons

1 hour 43 minutes

On-Demand

Free-Video

Data Science Conference

Lightweight Deep Learning on Edge Devices - Energy Efficient Approaches

Discover efficient deep learning techniques for edge devices, focusing on energy-saving approximation methods that maintain accuracy while improving performance on smartphones and IoT devices.

Add to list

13 Lesons

32 minutes

On-Demand

Free-Video

CNCF [Cloud Native Computing Foundation]

Efficient Edge Computing: Unleashing the Potential of AI/ML with Lightweight Kubernetes

Explore deploying AI/ML models in edge scenarios, comparing traditional and lightweight Kubernetes distributions. Learn strategies for efficient edge computing, including power consumption, model size, and performance optimization.

Add to list

1 Lesons

21 minutes

On-Demand

Free-Video

Bringing AI to the Heterogeneous Edge with WebAssembly and ONNX

Explore deploying AI workloads to edge devices using WebAssembly and ONNX, addressing challenges of heterogeneous environments, security, and resource constraints in a multiplatform development approach.

Add to list

1 Lesons

33 minutes

On-Demand

Free-Video

Freedom Arts - 3D Animation & Game Developer

Importing Geopipe New York City 3D Models into Unreal Engine 5.1 - Game Ready with Collision

Step-by-step guide to import New York City 3D models from Geopipe into Unreal Engine 5.1, creating a game-ready scene with perfect collision for immersive game development projects.

Add to list

1 Lesons

22 minutes

On-Demand

Free-Video

Freedom Arts - 3D Animation & Game Developer

Importing 100 Mixamo Characters into Smile Game Builder

Learn to import 100 free Mixamo characters into Smile Game Builder using 3DXchange and Simplygon. Step-by-step tutorial covers workflow and software integration for game development.

Add to list

1 Lesons

56 minutes

On-Demand

Free-Video

LLM Deployment Techniques - Lecture 13

Dive into advanced LLM deployment strategies, covering optimization techniques, infrastructure scaling, and practical implementation methods for efficient large language model deployment.

Add to list

1 Lesons

1 hour 16 minutes

On-Demand

Free-Video

EfficientML.AI - Introduction to Efficient Machine Learning

Master efficient machine learning techniques and optimization strategies for deploying ML models in resource-constrained environments, focusing on practical implementation methods.

Add to list

1 Lesons

1 hour 36 minutes

On-Demand

Free-Video

LLM Deployment Techniques - Lecture 13

Dive into advanced LLM deployment strategies, covering optimization techniques, infrastructure scaling, and practical implementation methods for efficient large language model deployment in production environments.

Add to list

1 Lesons

1 hour 17 minutes

On-Demand

Free-Video

AI Bites

Quantization in Deep Learning: Types, Algorithms, and Implementation

Explore quantization techniques in deep learning, from uniform to non-uniform approaches, and learn practical implementation strategies for optimizing large-scale neural networks.

Add to list

5 Lesons

13 minutes

On-Demand

Free-Video

AI Bites

QLoRA: Efficient Training of Large Language Models Using Quantization and Low-Rank Adaptation

Discover how QLoRA enables efficient training of large language models on a single GPU through innovative techniques like NormalFloat, Double Quantization, and Paged Optimizers.

Add to list

10 Lesons

12 minutes

On-Demand

Free-Video

CriticGPT: Understanding RLHF and Force Sampling Beam Search Optimization

Dive into OpenAI's innovative CriticGPT algorithm, exploring how RLHF and Force Sampling Beam Search optimize language models and enhance their reliability.

Add to list

1 Lesons

26 minutes

On-Demand

Free-Video

LoftQ: Understanding LoRA-Fine-Tuning-aware Quantization for LLMs

Dive into LoftQ, a groundbreaking LLM quantization method that combines with LoRA to achieve superior performance and efficiency in large language models.

Add to list

8 Lesons

14 minutes

On-Demand

Free-Video

ChatGPT vs Flan-T5: Comparing Proprietary and Free LLMs with Performance Tuning

Explore the differences between ChatGPT and Flan-T5 LLM models, comparing their capabilities, accessibility, and performance optimization through practical demonstrations and hyperparameter tuning techniques.

Add to list

1 Lesons

12 minutes

On-Demand

Free-Video

Running Flan-T5-XL Model on Google Colab - A Free Self-Explaining Language Model

Discover how to implement and optimize the Flan-T5-XL language model on Google Colab, exploring its capabilities in reasoning, essay writing, and translation tasks while working within free resource constraints.

Add to list

1 Lesons

18 minutes

On-Demand

Free-Video

UofU Data Science

Faster and Cheaper LLMs with Weight and Key-value Cache Quantization

Explore advanced techniques for optimizing Large Language Models through weight and key-value cache quantization methods to improve speed and reduce computational costs.

Add to list

1 Lesons

55 minutes

On-Demand

Free-Video

CNCF [Cloud Native Computing Foundation]

Accelerating High-Performance Machine Learning at Scale in Kubernetes

Hands-on guide for deploying optimized machine learning models in cloud native ecosystems, focusing on GPT-2 NLP model deployment in Kubernetes using ONNX Runtime and Seldon Core Triton server.

Add to list

1 Lesons

36 minutes

On-Demand

Free-Video

Aleksa Gordić - The AI Epiphany

EfficientNetV2 - Smaller Models and Faster Training - Paper Explained

Explore EfficientNetV2's improved image classification performance, including progressive training, Fused-MBConv layer, and novel reward function for Neural Architecture Search.

Add to list

7 Lesons

28 minutes

On-Demand

Free-Video

How to Make Your CPU as Fast as a GPU - Advances in Sparsity with Nir Shavit

Explore advances in sparsity and how CPUs can match GPU performance in neural networks. Learn about pruning, efficient algorithms, and the future of sparse architectures in deep learning.

Add to list

21 Lesons

50 minutes

On-Demand

Free-Video

Noether Networks - Meta-Learning Useful Conserved Quantities

Explore Noether Networks: a novel approach to meta-learning conserved quantities in sequential prediction problems, inspired by Noether's theorem and aimed at discovering useful symmetries and inductive biases.

Add to list

10 Lesons

1 hour 9 minutes

On-Demand

Free-Video

Bringing Choice, Automation and Performance to ML Deployment with Apache TVM and the OctoML Platform

Explore Apache TVM for ML deployment across diverse hardware, offering performance optimization and portability. Learn about OctoML's Octomizer for continuous model optimization and benchmarking.

Add to list

10 Lesons

30 minutes

On-Demand

Free-Video

Graham Neubig

Debugging Neural Nets for NLP

Comprehensive guide to diagnosing and resolving issues in neural networks for NLP, covering training, decoding, overfitting, and optimization techniques.

Add to list

17 Lesons

1 hour 14 minutes

On-Demand

Free-Video

Graham Neubig

Neural Nets for NLP - Debugging Neural Nets

Learn techniques for identifying and resolving issues in neural networks for NLP, covering training and test-time problems, optimization strategies, and performance analysis.

Add to list

25 Lesons

1 hour 15 minutes

On-Demand

Free-Video

Nerdy Rodent

LORA for Stable Diffusion - Dreambooth Extension - 6GB VRAM

Learn to train custom AI models using LORA Dreambooth for Stable Diffusion, even with limited 6GB VRAM. Discover faster, smaller, and improved techniques for personalized image generation.

Add to list

1 Lesons

22 minutes

On-Demand

Free-Video

1littlecoder

XGBoost and Data Leakage in Machine Learning - Day 14 of 30 Days of ML

Explore XGBoost and data leakage in machine learning. Learn to build optimized models, prevent leakage issues, and enhance your ML skills with practical tutorials and exercises.

Add to list

1 Lesons

37 minutes

On-Demand

Free-Video

Improving the Life of Data Scientists - Automating ML Lifecycle through MLflow

Explore Flock, an end-to-end platform leveraging MLflow to automate and simplify enterprise-grade machine learning, enhancing data scientist productivity and addressing regulatory challenges in ML adoption.

Add to list

17 Lesons

36 minutes

On-Demand

Free-Video

Data Science Dojo

Model Optimization - Series Conclusion - Introduction to Text Analytics with R

Optimize text analytics models, explore sensitivity/specificity tradeoffs, and discover resources for further study in R. Learn feature engineering and algorithm selection for improved effectiveness.

Add to list

1 Lesons

27 minutes

On-Demand

Free-Video

Movement Pruning - Adaptive Sparsity by Fine-Tuning

Explore Movement Pruning, an adaptive sparsity technique for fine-tuning deep neural networks. Learn its advantages over Magnitude Pruning in transfer learning scenarios and its impact on model efficiency.

Add to list

8 Lesons

30 minutes

On-Demand

Free-Video

Deconstructing Lottery Tickets - Zeros, Signs, and the Supermask

Explores the Lottery Ticket Hypothesis, analyzing key components of sparse networks and uncovering insights on weight initialization, sign importance, and the concept of Supermasks in neural network training.

Add to list

1 Lesons

36 minutes

On-Demand

Free-Video

Turing-NLG, DeepSpeed and the ZeRO Optimizer

Explore Microsoft's 17-billion parameter language model, ZeRO optimizer, and DeepSpeed, enabling efficient model and data parallelism for state-of-the-art natural language processing breakthroughs.

Add to list

1 Lesons

21 minutes

On-Demand

Free-Video

RoBERTa - A Robustly Optimized BERT Pretraining Approach

Explore how RoBERTa improves BERT's performance through optimized training, challenging recent model enhancements and highlighting the impact of hyperparameter choices in language model pretraining.

Add to list

1 Lesons

19 minutes

On-Demand

Free-Video

Derek Banas

Convolutional Neural Networks with TensorFlow 2022

Comprehensive live-coding tutorial on Convolutional Neural Networks using TensorFlow, covering data preparation, model architecture, optimization techniques, and practical applications in image recognition and object detection.

Add to list

1 Lesons

1 hour 42 minutes

On-Demand

Free-Video

Valerio Velardo - The Sound of AI

Audio Processing in Keras with Kapre

Learn to process audio in Keras using Kapre, exploring its advantages, features, and implementation. Discover how to embed audio transformations within deep learning models for optimized GPU-based calculations and easier deployment.

Add to list

10 Lesons

28 minutes

On-Demand

Free-Video

sentdex

Open AI's Whisper Is Amazing

Explore OpenAI's Whisper, a powerful speech recognition model capable of transcribing and translating 97 languages, with insights on its implementation, training, and performance.

Add to list

13 Lesons

26 minutes

On-Demand

Free-Video

Leaner and Greener AI with Quantization in PyTorch - Suraj Subramanian

Discover how quantization in PyTorch can make AI models lighter, faster, and more power-efficient without compromising accuracy. Learn techniques and workflows from an ML expert at Meta AI.

Add to list

8 Lesons

28 minutes

On-Demand

Free-Video

Tuning Machine Learning Models - Scaling, Workflows, and Architecture

Automating and scaling ML model tuning using Hyperopt and Apache Spark. Best practices for workflows, architecture, and optimization in hyperparameter tuning for improved performance and accuracy.

Add to list

10 Lesons

24 minutes

On-Demand

Free-Video

Scaling Up AI Research to Production with PyTorch and MLFlow

Explore PyTorch's latest advancements in AI research and production, including distributed training, model optimization, and deployment using MLFlow, with insights on scaling and efficiency.

Add to list

30 Lesons

44 minutes

On-Demand

Free-Video

1 kB and Not a Bit More - The Ideal Weight for a TinyML Model

Explore automated creation of ultra-compact ML models for tiny smart devices, enabling efficient embedding in memory-constrained hardware without data science expertise.

Add to list

18 Lesons

35 minutes

On-Demand

Free-Video

Nvidia

NVIDIA Tools for Training and Deploying Intelligent Vision Applications at the Edge

Discover tools for training, building, and deploying intelligent vision applications at the edge using NVIDIA's suite for video analytics pipelines and IoT devices.

Add to list

26 Lesons

1 hour 1 minute

On-Demand

Free-Video

Nvidia

Inference and Quantization for AI - Session 3

Explore quantized inference, TensorRT 5, and TensorFlow integration for AI model optimization. Learn about NVIDIA's inference server and techniques to improve model efficiency and performance.

Add to list

33 Lesons

41 minutes

On-Demand

Free-Video

On-Device Speech Models Optimization and Deployment for Mobile Hardware

Explore on-device speech model optimization and deployment, covering streaming-aware design, quantization techniques, and benchmarks for popular speech processing model topologies on mobile platforms.

Add to list

15 Lesons

26 minutes

On-Demand

Free-Video

Tiny Models with Big Appetites: Cultivating the Perfect Data Diet for Computer Vision

Explore techniques for curating optimal training datasets and designing sampling strategies for tiny computer vision models, emphasizing data quality and model robustness in real-world environments.

Add to list

14 Lesons

21 minutes

On-Demand

Free-Video

Delta Keyword Transformer: Bringing Transformers to the Edge Through Dynamically Pruned Multi-Head Self-Attention

Explore cutting-edge techniques for optimizing Transformers in edge computing, focusing on the Delta Keyword Transformer and its dynamic pruning approach for efficient multi-head self-attention.

Add to list

13 Lesons

21 minutes

On-Demand

Free-Video

Embedded Machine Learning in the Real World

Explore embedded ML applications, challenges, and opportunities in real-world scenarios, focusing on tiny models, accelerated hardware, and practical tooling.

Add to list

11 Lesons

27 minutes

On-Demand

Free-Video

Tiny but Powerful: Hardware for High Performance, Low Power Machine Learning

Explore cutting-edge hardware for ultra-low power machine learning at the edge, covering TinyML applications, model optimization, and emerging trends in efficient computing.

Add to list

16 Lesons

1 hour

On-Demand

Free-Video

Train One Network and Specialize It for Efficient Deployment

Explore efficient neural network deployment across diverse devices using the Once-for-All network approach, which enables quick specialization without retraining and outperforms state-of-the-art methods on edge devices.

Add to list

19 Lesons

36 minutes

On-Demand

Free-Video

Avoiding Loss of Quality in Tiny Models - Neuton.ai Partner Session

Explore compact AI models without sacrificing quality. Learn to assess model quality, understand decision-making logic, evaluate training data, and interpret model outputs for tinyML applications.

Add to list

23 Lesons

1 hour 2 minutes

On-Demand

Free-Video

Hardware Aware Training for Efficient Keyword Spotting - tinyML Research Symposium 2021

Explore hardware-aware training for efficient keyword spotting, focusing on Legendre Memory Unit networks to achieve state-of-the-art accuracy and power efficiency on various hardware platforms.

Add to list

10 Lesons

23 minutes

On-Demand

Free-Video

How Amazon Search Leverages PyTorch to Build and Deploy LLMs in Production

Explore Amazon Search's innovative use of PyTorch and its ecosystem for building and deploying large language models in production environments.

Add to list

1 Lesons

31 minutes

On-Demand

Free-Video

ML On-Device: Building Efficient Models

Explore on-device machine learning for efficient, privacy-preserving models on mobile and edge devices. Learn about real-time AI applications, model optimization, and deployment strategies.

Add to list

5 Lesons

34 minutes

On-Demand

Free-Video

deeplizard

CNN Training Loop Explained - Neural Network Code Project

Learn to build a convolutional neural network training loop using Python and PyTorch. Gain practical skills in implementing deep learning algorithms for image processing tasks.

Add to list

1 Lesons

22 minutes

On-Demand

Free-Video

Hyperparameter Tuning Using Kubeflow

Explore automated machine learning with Katib, a Kubernetes-native platform for hyperparameter tuning and neural architecture search, enhancing model performance efficiently.

Add to list

21 Lesons

35 minutes

On-Demand

Free-Video

AI Model Efficiency Toolkit

Explore cutting-edge techniques for optimizing AI models with Qualcomm's toolkit, enhancing performance and efficiency in machine learning applications.

Add to list

1 Lesons

53 minutes

On-Demand

Free-Video

ORPO: Monolithic Preference Optimization without Reference Model

Innovative approach to language model preference alignment without separate fine-tuning, achieving state-of-the-art performance using odds ratio optimization across various model sizes.

Add to list

1 Lesons

33 minutes

On-Demand

Free-Video

Prompt Engineering with Watsonx - Workshop

Aprenda técnicas de engenharia de prompts para otimizar modelos de linguagem, equilibrando inteligência e segurança. Exercícios práticos para dominar habilidades essenciais na interação com LLMs.

Add to list

1 Lesons

37 minutes

On-Demand

Free-Video

Data Science Dojo

Leveraging Open-Source LLMs for Production

Explore open-source LLMs, comparing them to proprietary options. Learn about hosting costs, performance evaluation, and practical applications. Gain insights into fine-tuning techniques and emerging trends in AI development.

Add to list

11 Lesons

1 hour 7 minutes

On-Demand

Free-Video

PyTorch 2.1 - New Features and Accelerating Generative AI Models

Explore PyTorch 2.1's new features for AI model acceleration, including compile, distributed, inference, and edge technologies, with insights on optimizing generative AI models for enhanced performance.

Add to list

6 Lesons

26 minutes

On-Demand

Free-Video

media.ccc.de

Shrinking Deep Learning Models: Techniques and Energy Considerations

Explore techniques to shrink deep learning models, reduce energy consumption, and run advanced architectures on commodity hardware. Gain insights into model efficiency and accessibility.

Add to list

1 Lesons

41 minutes

On-Demand

Free-Video

CNCF [Cloud Native Computing Foundation]

Strategies for Efficient LLM Deployments in Any Cluster

Explore strategies to optimize LLM deployments in Kubernetes clusters, focusing on reducing model size, improving resource utilization, and balancing performance with efficiency for cloud-to-edge scenarios.

Add to list

1 Lesons

31 minutes

On-Demand

Free-Video

Yacine Mahdid

Stochastic Depth for Neural Networks - Implementation and Analysis

Explore stochastic depth in neural networks: a regularization method for residual networks that enhances training speed and test performance. Includes methodology explanation and PyTorch implementation.

Add to list

14 Lesons

27 minutes

On-Demand

Free-Video

The Machine Learning Engineer

LLM Efficient Inference in CPUs and Intel GPUs - Intel Neural Speed

Optimize LLM inference on Intel CPUs and GPUs using Neural Speed. Explore efficient techniques for enhanced performance in machine learning and data science applications.

Add to list

1 Lesons

30 minutes

On-Demand

Free-Video

The Machine Learning Engineer

LLMOps: Using Nvidia TensorRT SDK for GPU Inference

Optimize GPU inference using Nvidia TensorRT SDK. Convert models, compare throughput, and explore batch sizes, data precision, and runtime options for enhanced performance.

Add to list

1 Lesons

34 minutes

On-Demand

Free-Video

EfficientML.ai Lecture - Introduction to Efficient Machine Learning

Explore cutting-edge techniques for optimizing machine learning models, focusing on efficiency and performance in real-world applications.

Add to list

1 Lesons

1 hour 34 minutes

On-Demand

Free-Video

IEEE Signal Processing Society

Model Based Deep Learning - Applications to Imaging and Communications

Explore model-based deep learning applications in imaging and communications with expert Yonina Eldar from the Weizman Institute of Science in this ICASSP 2023 keynote presentation.

Add to list

1 Lesons

1 hour 2 minutes

On-Demand

Free-Video

Black Hat

Garbage In, Garbage Out - How Purportedly Great Machine Learning Models Can Be Screwed Up by Bad Data

Explores how data quality impacts machine learning models for malicious URL detection, comparing results across different datasets and analyzing feature activations in neural networks.

Add to list

1 Lesons

28 minutes

On-Demand

Free-Video

USENIX

Accelerating Distributed MoE Training and Inference with Lina

Explore Lina, an innovative system for accelerating distributed Mixture of Experts training and inference, addressing all-to-all communication bottlenecks in large-scale AI models.

Add to list

1 Lesons

20 minutes

On-Demand

Free-Video

EuroPython Conference

How to Apply Deep Learning for 3D Object Recognition

Practical guide to achieving 80% accuracy in 3D object recognition using deep learning, covering data preparation, model optimization, and implementation strategies.

Add to list

9 Lesons

23 minutes

On-Demand

Free-Video

MLCon | Machine Learning Conference

MLOps - Automated Machine Learning Made Easy

Explore AI and ML optimization techniques to enhance business outcomes through data understanding and model improvement.

Add to list

1 Lesons

45 minutes

On-Demand

Free-Video

Scala Days Conferences

Doubt Truth to Be a Liar - Non Triviality of Type Safety for Machine Learning

Explore type-safe feature engineering in Scala for machine learning, covering Shapeless, macros, and quasiquotes to enhance ML framework design and model accuracy.

Add to list

19 Lesons

52 minutes

On-Demand

Free-Video

NDC Conferences

Deep Learning with PyTorch

Comprehensive introduction to deep learning using PyTorch, covering fundamentals, computer vision applications, and practical model creation for AI enthusiasts and developers.

Add to list

1 Lesons

1 hour 5 minutes

On-Demand

Free-Video

MLOps.community

Streamlining Model Deployment - AI in Production

Explore strategies for optimizing AI model deployment pipelines, including open governance, runtime benchmarks, and dynamic routing. Learn to make AI deployments faster, cheaper, and more accurate in a rapidly evolving landscape.

Add to list

1 Lesons

22 minutes

On-Demand

Free-Video

SK AI SUMMIT 2024

AI Camera 개성 부여를 위한 SKT Edge AI의 차별화된 Customized AI 기술

Discover SKT의 혁신적인 Edge AI 기술로 AI 카메라의 맞춤형 학습 및 현장 최적화 방식을 통해 제한된 환경에서도 고성능 비전 AI 구현 방법을 탐구합니다.

Add to list

1 Lesons

14 minutes

On-Demand

Free-Video

SK AI SUMMIT 2024

Squash - 딥러닝 서비스 추론 속도 개선기

Discover how Squash technology optimizes deep learning inference speed and reduces hardware costs through innovative model compression techniques for efficient AI service deployment.

Add to list

1 Lesons

16 minutes

On-Demand

Free-Video

SK AI SUMMIT 2024

이미지 센서에서의 딥러닝 모델 최적화와 구현 - SK 하이닉스의 On-Sensor AI 여정

Discover how SK Hynix engineers integrate deep learning models into image sensors, exploring AI semiconductor technology for enhanced mobile device functionality and image quality optimization.

Add to list

1 Lesons

22 minutes

On-Demand

Free-Video

Data Science Conference

Delivering Delivery Time Prediction - Machine Learning Lessons and Model Optimization

Discover practical insights into ML model development, from handling unclear requirements to ensemble classification techniques and determining optimal probability cutoffs for delivery predictions.

Add to list

1 Lesons

26 minutes

On-Demand

Free-Video

Toronto Machine Learning Series (TMLS)

Machine Learning on the Edge - From Microcontrollers to Embedded Linux Devices

Explore ML deployment on edge devices, from microcontrollers to embedded Linux systems. Learn about hardware options, model adaptation, and deployment strategies for real-world applications.

Add to list

1 Lesons

11 minutes

On-Demand

Free-Video

Toronto Machine Learning Series (TMLS)

Efficient Inference of Extremely Large Transformer Models