Jeff Liu Lab
HomeProjectsWorkshopAI WikiAI LabShop
Sign In
All
Computing Science
Artificial Intelligence
Deep Learning
Reinforcement Learning
AI Agents
Embodied Intelligence
Robot Engineering
Human-Like Intelligence
AI Engineering
← Back to Wiki
Robot Engineering
Computer Engineering Overview
CPU Architecture
GPU & Parallel Computing
Memory & Storage
Bus & Interfaces
Embedded Systems
Operating System Fundamentals

Comments (0)

Sign in to comment

Table of Contents
OverviewGPU Architecture FundamentalsCPU vs. GPU: Design PhilosophyNVIDIA GPU ArchitectureSIMT Execution ModelCUDA Programming FundamentalsThread HierarchyMemory HierarchyCUDA Kernel ExampleImage Processing ExampleParallel Speedup TheoryTheoretical SpeedupGustafson's LawTensorRT Inference AccelerationTensorRT Optimization PipelineKey Optimization TechniquesInference Performance ExamplesJetson GPU Specifications ComparisonGPU Programming Best Practices1. Maximize Occupancy2. Coalesced Memory Access3. Minimize CPU-GPU Data Transfers4. Leverage Tensor CoresRobot Vision Processing PipelineSummaryReferences

© 2026 Jeff Liu Lab. All rights reserved.

AboutPricingPrivacy & TermsContact