Research Guide for Video Frame Interpolation with Deep Learning

Articles

In this research guide, we’ll look at deep learning papers aimed at synthesizing video frames within an existing video. This could be in between video frames, known as interpolation, or after them, known as extrapolation.

The better part of this guide will cover interpolation. Interpolation is useful in software editing tools as well as in generating video animations. It can also be used to generate clear video frames in sections where a video is blurred.

Continue reading Research Guide for Video Frame Interpolation with Deep Learning

Optical Character Recognition Using Deep Learning Techniques

Articles

Optical character recognition (OCR) is a method that helps machines recognize texts. Traditional OCR uses patterns and correlation to differentiate words from other elements. However, these techniques don’t tend to produce results with high accuracy for complex text or in-motion streams. For these tasks, you’re better served using deep learning models. In that spirit, in this article we’ll explore three deep learning models for OCR.

Continue reading Optical Character Recognition Using Deep Learning Techniques

HAMR — 3D Hand Shape and Pose Estimation from a Single RGB Image

Articles

In recent years, research related to vision-based 3D image processing has become increasingly active, given its many applications in virtual reality (VR) and augmented reality (AR). Despite years of studies, however, there are still images that machines struggle to understand—one of those is images of human hands.

Hand image understanding targets the problem of recovering the spatial configuration of hands from natural RGB or/and depth images. This task has many applications, such as human-machine interaction and virtual/augmented reality, among others.

Continue reading HAMR — 3D Hand Shape and Pose Estimation from a Single RGB Image

Building a Vision-Controlled Car Using Raspberry Pi—From Scratch

Articles

In this tutorial, we’ll create a car that controlled using a USB camera and a Raspberry Pi. We’ll use a wood plank as the car body, which will hold all the car’s components. The components are:

After fixing the component using cable ties, we’ll connect the Raspberry Pito the motors using its general purpose input output (GPIO) pins. Similar to a traffic sign, the motors will run when the dominant color in the captured images is green and stop when it’s red.

Continue reading Building a Vision-Controlled Car Using Raspberry Pi—From Scratch

2019’s Top Machine Learning Articles

Articles

In this piece, we’ll take a closer look at some of the top machine learning articles in 2019, as ranked by MyBridge.

According to the article:

Pluribus was tested against professional poker players. This included two winners of the World Series of Poker Event. Pluribus uses self-play to teach itself how to win. This happens without any examples or guidance. The self-play version used in Pluribus is an improved version of the iterative Monte Carlo CFR (MCCFR) algorithm.

Continue reading 2019’s Top Machine Learning Articles

Four Important Computer Vision Annotation Tools you Need to Know in 2020

Articles

Computer vision is a field in artificial intelligence based on giving machines an understanding of data from images, video frames, pixel manipulations, and others. In this article, we will look into four commonly used annotation tools for object detection/tracking. To demonstrate these, we will be annotating team players on the football pitch.

The following is an outline on what we will be looking into:

Continue reading Four Important Computer Vision Annotation Tools you Need to Know in 2020

A 2019 Guide to Deep Learning-Based Image Compression

Articles

Compression involves processing an image to reduce its size so that it occupies less space. There are already codecs, such as JPEG and PNG, whose aim is to reduce image sizes. There are two types of image compression; lossy and lossless.

In lossless compression, one can retrieve the original image data, while in lossy compression one cannot. As a point of reference, PNG is lossless and JPEG is lossy.

Continue reading A 2019 Guide to Deep Learning-Based Image Compression