Machine learning requires large amounts of data to develop predictive models of the world. Help foster the development of machine learning by sharing or selling datasets you have developed or acquire new datasets to jumpstart your next project.
Knuckle Head Corporation is offering OCR images dataset for several industries like : Hotel, Cab Rental, Bar etc.
One Million OCR images dataset for several industries like : Hotel, Cab Rental, Bar etc. Every invoices are high quality images clicked by smartphones. We are covering USA and Indian business in those invoices.
There are three types of invoices (Well Light, Low Light and Shadow). Invoices are clicked in indoor and outdoor with different background.
Legal Clause Classification Dataset built from various sources like multiple contracts, online contract texts etc and Label them into 24 categories. The sole purpose of this dataset is to identify any given contract text into one of the Clause labels. Although the pre-defined categories can be customized according to the user requirements and same goes for the dataset contract text.
Twitch stream data collected from ~2500 popular Twitch streamers over 4 months (9/24/2020 to 2/05/2021). Real time data for live streamers updated approx. every 5 minutes. Dataset includes current timestamp, streamer name, stream title, game_id, stream start time, and viewership count. Contains 7,936,251 live stream data instances.