Skip to content
This repository has been archived by the owner on Oct 26, 2024. It is now read-only.

Jerome's Checklist #10

Open
7 of 24 tasks
Ishaan-Datta opened this issue Apr 15, 2024 · 0 comments
Open
7 of 24 tasks

Jerome's Checklist #10

Ishaan-Datta opened this issue Apr 15, 2024 · 0 comments
Assignees

Comments

@Ishaan-Datta
Copy link
Contributor

Ishaan-Datta commented Apr 15, 2024

  • Converting .onnx file to .trt in colab
  • Implementing the TensorRT engine inference using NVIDIA examples and other tutorials in C++
  • Optimizing the inference with batching, CUDA contexts, multithreading/asynchronous streams
  • Include best practices: NVIDIA TensorRT optimizations (include creating interfaces or abstract classes that define the common functionality required for neural network processing)
  • Comparison w/ industry examples (Nvidia, Zed, Github examples) to adapt our case to the Jetson hardware
  • Metric logging for benchmarking using Nvidia tutorials
  • Test different batch sizes to maximize the GPU utlilization
  • Investigate any other potential areas for multiprocessing or parallel execution to speed up task processing

Links Checklist:

Links Checklist (some of these may overlap w/ Jerome's so maybe meet with him to discuss progress and what needs to be revisited or reviewed)

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants