Introduction to True Intelligence
Exploring the principles of compositional understanding and adaptability in AI systems - how can we create AI that reasons and solves problems beyond its training data? This article examines the theoretical foundations and practical approaches to achieving true machine intelligence.

Introduction to Diffusion Models
Mathematical introduction to Diffusion Models in Generative Modeling

Introduction to Lean
It’s a short introduction to lean theorem prover. It first part of the series covering Simple Type Theory and simple proof of infinitude of primes to give a feel of how it works.

How to read HeadCT
This blog reflects on how to read head CT scans and what are different types of bleeds?

Brain Anatomy using CT Scans
What is a CT scan? Brain components visible in CT scan and their relevance.

Introduction to Pydicom
This blog is a tutorial on pydicom which is a pure Python package for working with Dicom files. It lets you read, modify and write Dicom data in an easy “pythonic” way.

DICOM Standards
What is Dicom? What standards does it follow? OHIF Viewer Walkthrough.

Docker Overview
Leveraging docker to encourage reproducibility and easy maintainace of project.

Heart Rate detection using camera
How euler video magnification can be used to magnify motion and color.

Markov Decision Process
A framework for decision making in situations where outcomes are partly random and partly under the control of a decision-maker.

Knowledge Distillation
Transforming large deep learning models to smaller models woith imporved perform.

Book Recommendations

Never Split the Difference: Negotiating As If Your Life Depended On It
Christopher Voss and Tahl Raz

Thinking, Fast and Slow
Daniel Kahneman

Predictably Irrational
Dan Ariely

Notes

BANTA & ZOPA
Some negotiation strategies.

Presentations

ASPECT Scoring
ASPECTS is a 10-point quantitative topographic CT scan score used for patients with middle cerebral artery (MCA) stroke. To compute ASPECTS, 1 point is subtracted from 10 for any evidence of early ischemic change for each of the defined regions. This CT score is simple and reliable and may be helpful in the early stages of care for supporting transfer or therapy decisions.

Deep Meta Handles
A novel approach to learning deformation handles for 3D meshes using biharmonic coordinates. Rather than generating meshes from scratch, which is challenging due to irregular data structures, this work focuses on learning to deform existing mesh geometries while preserving their connectivity structure. The method enables intuitive and efficient mesh manipulation through learned meta-handles.

VMNet
VMNet is a novel deep learning architecture that combines geodesic and euclidean convolutions over 3D meshes and their voxel representations. Rather than operating on a single representation, this dual-stream approach leverages both the precise geometric information from mesh-based geodesic convolutions and the regular grid structure of voxel-based euclidean convolutions. The network performs semantic segmentation on 3D scenes from the ScanNet dataset, effectively labeling different parts of indoor environments.

SimSiam
SimSiam is a simple framework for self-supervised learning that demonstrates how siamese networks are fundamental to contrastive learning methods. It achieves competitive performance using stop-gradient operations and small batch sizes, showing that complex techniques used in other methods like MoCo, BYOL, SimCLR and SwAV are mainly design choices rather than essential components.

BYOL
BYOL (Bootstrap Your Own Latent) is a novel self-supervised learning approach that achieves state-of-the-art performance without relying on negative pairs. Using two neural networks that learn from each other through different augmented views of images, it reaches 74.3% top-1 accuracy on ImageNet with ResNet-50 and demonstrates strong transfer learning capabilities.

Blog - Ujjwal Upadhyay