Documents
Fast and accurate annotation of invoices and bills of lading amongst other document types. A team of annotators was hired to work on a permanent basis, facilitating a regular and reliable flow of daily document annotation.
Annotation of documents requiring specialised knowledge, including shipping documents. Team was trained by a lead with industry knowledge, along with client guidance and expertise.
Financials
Annotation of quarterly and annual reports of companies and investment funds. Annotators gained in-depth knowledge of financial terms and financial reporting structure. Extraction of company information from report notes section and proof-reading of internal financial reports.
Extensive dataset construction of UK-based private equity funds and their portfolio companies, including portfolio company information. Annotators collaborated on a Google Docs file, which was handed over to client upon full coverage of the private equity industry.
Reclassification of more than 10,000 UK-based companies into industries using Companies' House information and SIC codes. Annotators were tasked with reading companies' descriptions, attributing sub-sector and sector keywords for the purpose of building a training set for company-sector classification.
Images
Annotation of quarterly and annual reports of companies and investment funds. Annotators gained in-depth knowledge of financial terms and financial reporting structure. Extraction of company information from report notes section and proof-reading of internal financial reports.
Locating and labelling of bar codes on goods sold in supermarkets while double-checking, overriding and making corrections to the model's results if necessary.
Web-based
Extraction and classification of tweets and comments into different categories based on whether the comment agrees or disagrees with its related tweet. Annotators tasked with understanding comments and making subjective but consistent judgements

Website annotation and classification of text on company homepages into categories including industry, business activity, product/service etc. Consistency and understanding of context was vital to ensure good quality of data.
Extraction of company LinkedIn data and information on companies' use of LinkedIn, Twitter and other plug-ins in company websites using tools such as Ghostery
