PaperSummary19 : CornerNet

2 min readJan 19, 2025

The paper introduces CornerNet, a novel (2018, then) object detection framework that identifies objects by detecting the top-left and bottom-right corners of their bounding boxes using a single convolutional neural network (ConvNet). Unlike traditional approaches, it eliminates the need for anchor boxes (a common but computationally expensive component in object detection) by formulating the task as a keypoint detection problem. CornerNet also proposes a new pooling method, corner pooling, to better localize the corners by incorporating contextual information.

The methodology is as follows:

Keypoint Detection: Two heatmaps are predicted, one for top-left corners and another for bottom-right corners, each corresponding to specific object categories. An associative embedding technique groups corresponding corners of the same object by minimizing the embedding distance between them.
Corner Pooling: A pooling mechanism designed to capture corner-specific features. It looks horizontally and vertically to locate the boundaries defining a corner.
Network Architecture: It uses an hourglass network as the backbone, modified to output heatmaps, embeddings and offsets for precise localization. A loss function combining focal loss (for heatmaps), offset regression loss and embedding losses (pull and push losses) is used to train the network.
Post-Processing: Non-maximum suppression (NMS) is applied to refine corner detections followed by embedding-based grouping and bounding box generation.

Image src: https://arxiv.org/pdf/1808.01244

CornerNet outperforms one-stage detectors on the MS COCO dataset with an Average Precision (AP) of 42.2%, making it competitive with two stage detectors.

References:

CornerNet: Detecting Objects as Paired Keypoints

We propose CornerNet, a new approach to object detection where we detect an object bounding box as a pair of keypoints…

arxiv.org

GitHub - princeton-vl/CornerNet

Contribute to princeton-vl/CornerNet development by creating an account on GitHub.

github.com

PaperSummary19 : CornerNet

CornerNet: Detecting Objects as Paired Keypoints

We propose CornerNet, a new approach to object detection where we detect an object bounding box as a pair of keypoints…

GitHub - princeton-vl/CornerNet

Contribute to princeton-vl/CornerNet development by creating an account on GitHub.

Written by Poonam Saini

No responses yet