TensorRT to improve Inference performance

Tags machine learning, technique Property Table of Content How TensorRT improve inference performance Tradeoff of TensorRT Conversion workflow How TensorRT improve model performance 1. Precision Calibration Convert weights and activation function from precision FP32 to FP16, INT8 to reduce the size of weights. This can cause the decrease in accuracy (sometimes significant) In real-time application, … Continue reading TensorRT to improve Inference performance →

The correct way to measure inference time of Deep Neural Networks

Tags machine learning, technique Property Mistakes when measure the inference time of Deep Neural Networks 1. Transferring data between host and devices (CPU and GPU) Most common mistake is measure time of transferring data from CPU to GPU. This transfer is done unintentionally when a tensor is created on CPU and then performed on GPU. … Continue reading The correct way to measure inference time of Deep Neural Networks →

Book I have read in 2022

March Cứ bay rồi sẽ cao - Nguyễn Phi Vân, Nguyễn Tuần Huỳnh The book gives advices from authors to young people to have a successful and happy life. Luôn biết ơn, đừng take it for grantedChyên nghiệp là hoàn thành công việc trên cả mong đợiLuôn đặt mình vào vị trí người làm chủ … Continue reading Book I have read in 2022 →

Object Tracking

Part 1: Single object tracking Part 2: Multiple Object Tracking Part 3: Optical Flow and Car Speed Estimation Part 4: Lane Detection