Many organizations in recent years have been taking advantage of the power of machine learning (ML). From natural language processing (NLP) to image recognition, fields from advertising to zoology have benefitted from the still-emerging power of ML. Humans are getting better at making sense from a glut of data. Recent technical advances have improved our ability to clean, engineer, and label our data. New tools have also helped us design and train ML models.
While these tools have helped data engineers and data scientists accelerate their work, most tools only act as components of the machine learning pipeline. Data scientists…
A general evaluation of ALPR systems for commercial models.
Anno.Ai’s data science team often conducts benchmark assessments of commercial and open-source AI/ML models to gauge their fit for our customers’ use cases (for example, see our assessments of handwritten text recognition (HTR) models and named entity recognition (NER) models here and here). Recently we looked at commercial providers of automatic license plate recognition (ALPR) models. Many of our customers have large bodies of unstructured data that include images and videos that are used for post-event analysis. Our customers often require the ability to triage their data quickly to identify objects…
An evaluation of Named Entity Recognition models for commercial NLP offerings.
As part of our series on AI/ML model evaluations, the Anno.Ai data science team delved into the world of Natural Language Processing (NLP). Many of our customers have NLP needs, so we decided to explore a variety of online and offline NLP libraries and services. For this task our data science team looked into Named Entity Recognition (NER) and this article is specifically focused on online vendors.
Given a collection of documents or other unstructured text, it is useful to be able to identify and extract information that falls…
CVPR 21 Highlights
Person re-identification (PreID) is the task of associating images or video frames of the same person taken from different cameras or from the same camera at different times. PreID has a number of applications, including automated contact tracing, missing child detection, and real-time athlete statistics summaries at sporting events. PreID can be categorized along a number of dimensions, including: level of supervision, source, and modality. In this article, we consider unsupervised or weakly supervised, multi-source, unimodal PreID specifically (and drop the adjectives at this point).
IEEE’s CVPR conference is one of the most popular among the computer vision community. 1600 papers were accepted to CVPR21, showcasing a number of impressive advances over the past year. We highlight eight exceptional papers that should be read, with code that should be run, when your busy schedule permits it.
As previously published by Amina Al Sherif
As people occasionally do, I was asked to attend a conference that involved a lot of listening and not much speaking, which is generally what happens when your speaker list consists of more than thirty individuals on a Zoom call.
Despite my lack of ability to speak at said conference, I did think the questions posed were worthwhile. As I settle into my new role as Chief Data Ethicist at Anno.Ai, I realize now more than ever that data ethics is extremely closely tied with subjects on AI security.
I will present my…
Labeling images in preparation for neural network training can be time-consuming and tedious, especially when working with polygons. The most common tools available require the user to place multiple points around the object they wish to segment. These tools either draw straight lines from point to point or try their best to snap the line around the edges of the object. Sometimes these tools require 10 or more points to create a decent segmentation of the object.
… and a comparison with cloud provider translation services
As we discussed in our previous post, the Anno.Ai data science team has continued evaluating machine learning model providers by testing machine translation offerings. In Part 1, we compared the Google Cloud Platform (GCP), Amazon Web Services (AWS), and Microsoft Azure APIs for translating Arabic, Chinese, Persian, and Russian into English.
While the commercial cloud services provide a great option for online use cases, some of our use cases require running models in an offline environment and/or the flexibility to re-train and tune these models to more specific data environments. For…
Anno.Ai’s data science team has continued evaluating online machine learning model providers by testing machine translation offerings. This evaluation follows our previous benchmarking studies of handwriting recognition, named entity recognition, and automatic license plate recognition providers.
For this exercise, we envisioned a customer who needs to quickly grasp the meaning, tone, and intent of short text blocks. The type of text might range from a formal document to more colloquial communications containing idioms, abbreviations, or social media-style hashtags. This customer may not need publication-ready translated output, but understanding nuance and context from a variety of speaker/writer styles is key.
A quick start guide to version control for machine learning data
As part of a larger effort to test and evaluate different MLOps frameworks, the data science team at Anno.Ai recently tested out DVC to improve integration between our model repos on GitHub and our data and model storage on Amazon S3. In this article, we provide a quick guide to getting set up with DVC and some tips we learned along the way.
DVC (Data Version Control) is an open-source application for machine learning project version control — think Git for data. …