Forem

# computervision

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Challenges to adapt AI-based Video Codecs
Cover image for Challenges to adapt AI-based Video Codecs

Challenges to adapt AI-based Video Codecs

Comments
5 min read
Vision Transform

Vision Transform

Comments
16 min read
Tech in Roofing: Drones, CV, and LLMs that ship better inspections tags: ai, drones, computervision, construction, casestudy
Cover image for Tech in Roofing: Drones, CV, and LLMs that ship better inspections tags: ai, drones, computervision, construction, casestudy

Tech in Roofing: Drones, CV, and LLMs that ship better inspections tags: ai, drones, computervision, construction, casestudy

Comments
2 min read
Optimizing Multi-Zone Restaurant Service with Computer Vision for Hospitality
Cover image for Optimizing Multi-Zone Restaurant Service with Computer Vision for Hospitality

Optimizing Multi-Zone Restaurant Service with Computer Vision for Hospitality

Comments
9 min read
Smart Stable Monitoring System for Premium Remote Horse Care
Cover image for Smart Stable Monitoring System for Premium Remote Horse Care

Smart Stable Monitoring System for Premium Remote Horse Care

1
Comments
9 min read
From SageMaker to Static Site: Hosting a Deep Learning Model on the Frontend
Cover image for From SageMaker to Static Site: Hosting a Deep Learning Model on the Frontend

From SageMaker to Static Site: Hosting a Deep Learning Model on the Frontend

Comments
4 min read
How I Built an AI-Powered Face Recognition App from Scratch

How I Built an AI-Powered Face Recognition App from Scratch

Comments
1 min read
Building a Diffusion Model from Scratch: CIFAR-10 in 15 Minutes
Cover image for Building a Diffusion Model from Scratch: CIFAR-10 in 15 Minutes

Building a Diffusion Model from Scratch: CIFAR-10 in 15 Minutes

Comments
5 min read
[memo]SafeVLA: Towards Safety Alignment of VisionLanguage-Action Model via Constrained Learning

[memo]SafeVLA: Towards Safety Alignment of VisionLanguage-Action Model via Constrained Learning

Comments
1 min read
Frontiers in Computer Vision: Foundation Models, Multimodal Learning, Robustness, and Privacy from the July 2025 arXiv H

Frontiers in Computer Vision: Foundation Models, Multimodal Learning, Robustness, and Privacy from the July 2025 arXiv H

Comments
7 min read
Building a Motion Tracking Balloon Burst Game with Python & OpenCV
Cover image for Building a Motion Tracking Balloon Burst Game with Python & OpenCV

Building a Motion Tracking Balloon Burst Game with Python & OpenCV

Comments
3 min read
Two Face Recognition Projects Failed. $33K Burned — All Because of Bad Camera Setup
Cover image for Two Face Recognition Projects Failed. $33K Burned — All Because of Bad Camera Setup

Two Face Recognition Projects Failed. $33K Burned — All Because of Bad Camera Setup

Comments
3 min read
Does DINO loss compare the [CLS] tokens from both teacher and student?

Does DINO loss compare the [CLS] tokens from both teacher and student?

Comments
1 min read
Modular Snip Recorder: A Data Collection Tool for Behavior Cloning (1/2)

Modular Snip Recorder: A Data Collection Tool for Behavior Cloning (1/2)

Comments
5 min read
[memo] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

[memo] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Comments
1 min read
Building a Deep Learning Model to Detect Potato Diseases: My Journey with PlantVillage.
Cover image for Building a Deep Learning Model to Detect Potato Diseases: My Journey with PlantVillage.

Building a Deep Learning Model to Detect Potato Diseases: My Journey with PlantVillage.

2
Comments 1
3 min read
How Do NLP and Computer Vision Work Together in Modern AI Applications?
Cover image for How Do NLP and Computer Vision Work Together in Modern AI Applications?

How Do NLP and Computer Vision Work Together in Modern AI Applications?

Comments
4 min read
Inside the Research: A Detailed Technical Breakdown of SQD in Quantum Chemistry
Cover image for Inside the Research: A Detailed Technical Breakdown of SQD in Quantum Chemistry

Inside the Research: A Detailed Technical Breakdown of SQD in Quantum Chemistry

Comments
4 min read
VideoPrism: A Foundational Visual Encoder for Video Understanding

VideoPrism: A Foundational Visual Encoder for Video Understanding

Comments
1 min read
Comparing 3 ways to Train a Face Mask Classifier: Tensorflow, AWS Canvas, and Rekognition
Cover image for Comparing 3 ways to Train a Face Mask Classifier: Tensorflow, AWS Canvas, and Rekognition

Comparing 3 ways to Train a Face Mask Classifier: Tensorflow, AWS Canvas, and Rekognition

Comments
10 min read
How to Port CV/ML Models to Rockchip NPU for Faster Face Recognition
Cover image for How to Port CV/ML Models to Rockchip NPU for Faster Face Recognition

How to Port CV/ML Models to Rockchip NPU for Faster Face Recognition

Comments
3 min read
Recent Advances in Computer Vision: Generative Models, Multimodal Learning, Scene Understanding, and Robustness – An Aca

Recent Advances in Computer Vision: Generative Models, Multimodal Learning, Scene Understanding, and Robustness – An Aca

Comments
9 min read
Histogram equalization CLAHE algorithm.

Histogram equalization CLAHE algorithm.

Comments
1 min read
Frontiers in Computer Vision: Synthesizing Advances in Multimodal Perception, Representation Learning, and Efficiency fr

Frontiers in Computer Vision: Synthesizing Advances in Multimodal Perception, Representation Learning, and Efficiency fr

Comments
10 min read
🚗👁️ Segmentation d'Images pour le système embarqué d’une voiture autonome
Cover image for 🚗👁️ Segmentation d'Images pour le système embarqué d’une voiture autonome

🚗👁️ Segmentation d'Images pour le système embarqué d’une voiture autonome

Comments
21 min read
loading...