Pose detection is a computer vision technique that can track and accurately predict the stance and position of a person’s body. Google’s ML Kit Pose Detection API enables you to do this in real time or from a static image, with a set of skeletal points. These points, known as landmarks, correspond to numerous body […]
Text recognition, commonly referred to as optical character recognition (OCR), is a widespread technology used for recognizing text in images or documents; in our case it is from a live camera feed. OCR has many uses in everyday life, such as automated data-entry, identification, invoice checking, document copying and handwriting recognition, amongst many other things. […]
ML Kit is a collection of API’s that can be used to integrate machine learning processing into your own applications. Including computer vision and natural language API’s, the package is tailored towards usability and ease of access, while maintaining most of the processing under the hood. It can work with the Camera X API, which […]
The use of facial recognition technology is rapidly gaining momentum. Already integrated in many aspects of our day-to-day lives, it is one of the most well-known and widely used applications of machine learning. Capable of comparing and matching a person’s face from a digital image or live feed against a database of collected faces, the […]