Forem

# computervision

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
From SageMaker to Static Site: Hosting a Deep Learning Model on the Frontend
Cover image for From SageMaker to Static Site: Hosting a Deep Learning Model on the Frontend

From SageMaker to Static Site: Hosting a Deep Learning Model on the Frontend

Comments
4 min read
How I Built an AI-Powered Face Recognition App from Scratch

How I Built an AI-Powered Face Recognition App from Scratch

Comments
1 min read
Building a Diffusion Model from Scratch: CIFAR-10 in 15 Minutes
Cover image for Building a Diffusion Model from Scratch: CIFAR-10 in 15 Minutes

Building a Diffusion Model from Scratch: CIFAR-10 in 15 Minutes

Comments
5 min read
Smart Stable Monitoring System for Premium Remote Horse Care
Cover image for Smart Stable Monitoring System for Premium Remote Horse Care

Smart Stable Monitoring System for Premium Remote Horse Care

1
Comments
9 min read
[memo]SafeVLA: Towards Safety Alignment of VisionLanguage-Action Model via Constrained Learning

[memo]SafeVLA: Towards Safety Alignment of VisionLanguage-Action Model via Constrained Learning

Comments
1 min read
Frontiers in Computer Vision: Foundation Models, Multimodal Learning, Robustness, and Privacy from the July 2025 arXiv H

Frontiers in Computer Vision: Foundation Models, Multimodal Learning, Robustness, and Privacy from the July 2025 arXiv H

Comments
7 min read
Building a Motion Tracking Balloon Burst Game with Python & OpenCV
Cover image for Building a Motion Tracking Balloon Burst Game with Python & OpenCV

Building a Motion Tracking Balloon Burst Game with Python & OpenCV

Comments
3 min read
Two Face Recognition Projects Failed. $33K Burned — All Because of Bad Camera Setup
Cover image for Two Face Recognition Projects Failed. $33K Burned — All Because of Bad Camera Setup

Two Face Recognition Projects Failed. $33K Burned — All Because of Bad Camera Setup

Comments
3 min read
Does DINO loss compare the [CLS] tokens from both teacher and student?

Does DINO loss compare the [CLS] tokens from both teacher and student?

Comments
1 min read
Modular Snip Recorder: A Data Collection Tool for Behavior Cloning (1/2)

Modular Snip Recorder: A Data Collection Tool for Behavior Cloning (1/2)

Comments
5 min read
[memo] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

[memo] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Comments
1 min read
Building a Deep Learning Model to Detect Potato Diseases: My Journey with PlantVillage.
Cover image for Building a Deep Learning Model to Detect Potato Diseases: My Journey with PlantVillage.

Building a Deep Learning Model to Detect Potato Diseases: My Journey with PlantVillage.

2
Comments 1
3 min read
How Do NLP and Computer Vision Work Together in Modern AI Applications?
Cover image for How Do NLP and Computer Vision Work Together in Modern AI Applications?

How Do NLP and Computer Vision Work Together in Modern AI Applications?

Comments
4 min read
Inside the Research: A Detailed Technical Breakdown of SQD in Quantum Chemistry
Cover image for Inside the Research: A Detailed Technical Breakdown of SQD in Quantum Chemistry

Inside the Research: A Detailed Technical Breakdown of SQD in Quantum Chemistry

Comments
4 min read
VideoPrism: A Foundational Visual Encoder for Video Understanding

VideoPrism: A Foundational Visual Encoder for Video Understanding

Comments
1 min read
Comparing 3 ways to Train a Face Mask Classifier: Tensorflow, AWS Canvas, and Rekognition
Cover image for Comparing 3 ways to Train a Face Mask Classifier: Tensorflow, AWS Canvas, and Rekognition

Comparing 3 ways to Train a Face Mask Classifier: Tensorflow, AWS Canvas, and Rekognition

Comments
10 min read
How to Port CV/ML Models to Rockchip NPU for Faster Face Recognition
Cover image for How to Port CV/ML Models to Rockchip NPU for Faster Face Recognition

How to Port CV/ML Models to Rockchip NPU for Faster Face Recognition

Comments
3 min read
Recent Advances in Computer Vision: Generative Models, Multimodal Learning, Scene Understanding, and Robustness – An Aca

Recent Advances in Computer Vision: Generative Models, Multimodal Learning, Scene Understanding, and Robustness – An Aca

Comments
9 min read
Histogram equalization CLAHE algorithm.

Histogram equalization CLAHE algorithm.

Comments
1 min read
Frontiers in Computer Vision: Synthesizing Advances in Multimodal Perception, Representation Learning, and Efficiency fr

Frontiers in Computer Vision: Synthesizing Advances in Multimodal Perception, Representation Learning, and Efficiency fr

Comments
10 min read
🚗👁️ Segmentation d'Images pour le système embarqué d’une voiture autonome
Cover image for 🚗👁️ Segmentation d'Images pour le système embarqué d’une voiture autonome

🚗👁️ Segmentation d'Images pour le système embarqué d’une voiture autonome

Comments
21 min read
Seeing the World: A Beginner's Guide to Convolutional Neural Networks (CNNs) with PyTorch
Cover image for Seeing the World: A Beginner's Guide to Convolutional Neural Networks (CNNs) with PyTorch

Seeing the World: A Beginner's Guide to Convolutional Neural Networks (CNNs) with PyTorch

Comments
8 min read
Advancements in Computer Vision and Pattern Recognition: A Synthesis of Emerging Themes and Innovations from May 2025 ar

Advancements in Computer Vision and Pattern Recognition: A Synthesis of Emerging Themes and Innovations from May 2025 ar

Comments
7 min read
Advancements in Computer Vision: Innovations and Challenges in Continual Learning, Generative Modeling, and Anomaly Dete

Advancements in Computer Vision: Innovations and Challenges in Continual Learning, Generative Modeling, and Anomaly Dete

Comments
7 min read
Recent Advances in Computer Vision: Efficient Adaptation, 3D Understanding, Robustness, Multi-Modal Fusion, Medical Appl

Recent Advances in Computer Vision: Efficient Adaptation, 3D Understanding, Robustness, Multi-Modal Fusion, Medical Appl

Comments
12 min read
loading...