Forem

# computervision

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Illuminating the Dark: Next-Gen Object Detection from Raw Sensor Data by Arvind Sundararajan

Illuminating the Dark: Next-Gen Object Detection from Raw Sensor Data by Arvind Sundararajan

1
Comments
2 min read
Let’s unlock Synthetic Presence with SadTalker in Google Colab And Bring Images to Life
Cover image for Let’s unlock Synthetic Presence with SadTalker in Google Colab And Bring Images to Life

Let’s unlock Synthetic Presence with SadTalker in Google Colab And Bring Images to Life

Comments
5 min read
Illuminating the Unseen: AI-Powered Clarity in Low-Light Imaging

Illuminating the Unseen: AI-Powered Clarity in Low-Light Imaging

1
Comments
2 min read
Object-Aware Navigation: Giving Robots a Human Understanding of Space

Object-Aware Navigation: Giving Robots a Human Understanding of Space

1
Comments
2 min read
How to set up a Raspberry Pi camera with Shinobi for reliable, 24/7 CCTV monitoring
Cover image for How to set up a Raspberry Pi camera with Shinobi for reliable, 24/7 CCTV monitoring

How to set up a Raspberry Pi camera with Shinobi for reliable, 24/7 CCTV monitoring

2
Comments
5 min read
See to Do: Teaching Robots to Handle the Real World by Arvind Sundararajan

See to Do: Teaching Robots to Handle the Real World by Arvind Sundararajan

1
Comments
2 min read
From Pixel to Perfection: Instant 3D Models from Single Images by Arvind Sundararajan

From Pixel to Perfection: Instant 3D Models from Single Images by Arvind Sundararajan

1
Comments
2 min read
Unlock the Secrets of Unlabeled Videos: A Deep Dive into Zero-Effort AI Training

Unlock the Secrets of Unlabeled Videos: A Deep Dive into Zero-Effort AI Training

1
Comments
2 min read
Forget Labels: AI Learns Continuously From Raw Video (and It's a Game Changer)

Forget Labels: AI Learns Continuously From Raw Video (and It's a Game Changer)

1
Comments
2 min read
Seeing in the Dark: Unveiling Hidden Details with Adaptive Image Processing

Seeing in the Dark: Unveiling Hidden Details with Adaptive Image Processing

1
Comments
2 min read
Vision Transform

Vision Transform

Comments
16 min read
Challenges to adapt AI-based Video Codecs
Cover image for Challenges to adapt AI-based Video Codecs

Challenges to adapt AI-based Video Codecs

Comments 1
5 min read
Tech in Roofing: Drones, CV, and LLMs that ship better inspections tags: ai, drones, computervision, construction, casestudy
Cover image for Tech in Roofing: Drones, CV, and LLMs that ship better inspections tags: ai, drones, computervision, construction, casestudy

Tech in Roofing: Drones, CV, and LLMs that ship better inspections tags: ai, drones, computervision, construction, casestudy

Comments
2 min read
Optimizing Multi-Zone Restaurant Service with Computer Vision for Hospitality
Cover image for Optimizing Multi-Zone Restaurant Service with Computer Vision for Hospitality

Optimizing Multi-Zone Restaurant Service with Computer Vision for Hospitality

Comments
9 min read
From SageMaker to Static Site: Hosting a Deep Learning Model on the Frontend
Cover image for From SageMaker to Static Site: Hosting a Deep Learning Model on the Frontend

From SageMaker to Static Site: Hosting a Deep Learning Model on the Frontend

Comments
4 min read
How I Built an AI-Powered Face Recognition App from Scratch

How I Built an AI-Powered Face Recognition App from Scratch

Comments
1 min read
Building a Diffusion Model from Scratch: CIFAR-10 in 15 Minutes
Cover image for Building a Diffusion Model from Scratch: CIFAR-10 in 15 Minutes

Building a Diffusion Model from Scratch: CIFAR-10 in 15 Minutes

Comments
5 min read
Smart Stable Monitoring System for Premium Remote Horse Care
Cover image for Smart Stable Monitoring System for Premium Remote Horse Care

Smart Stable Monitoring System for Premium Remote Horse Care

1
Comments
9 min read
[memo]SafeVLA: Towards Safety Alignment of VisionLanguage-Action Model via Constrained Learning

[memo]SafeVLA: Towards Safety Alignment of VisionLanguage-Action Model via Constrained Learning

Comments
1 min read
Frontiers in Computer Vision: Foundation Models, Multimodal Learning, Robustness, and Privacy from the July 2025 arXiv H

Frontiers in Computer Vision: Foundation Models, Multimodal Learning, Robustness, and Privacy from the July 2025 arXiv H

Comments
7 min read
Building a Motion Tracking Balloon Burst Game with Python & OpenCV
Cover image for Building a Motion Tracking Balloon Burst Game with Python & OpenCV

Building a Motion Tracking Balloon Burst Game with Python & OpenCV

Comments
3 min read
Two Face Recognition Projects Failed. $33K Burned — All Because of Bad Camera Setup
Cover image for Two Face Recognition Projects Failed. $33K Burned — All Because of Bad Camera Setup

Two Face Recognition Projects Failed. $33K Burned — All Because of Bad Camera Setup

Comments
3 min read
Does DINO loss compare the [CLS] tokens from both teacher and student?

Does DINO loss compare the [CLS] tokens from both teacher and student?

Comments
1 min read
Modular Snip Recorder: A Data Collection Tool for Behavior Cloning (1/2)

Modular Snip Recorder: A Data Collection Tool for Behavior Cloning (1/2)

Comments
5 min read
[memo] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

[memo] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Comments
1 min read
loading...