Jun 8, 2018
When Costanoa opened its doors in late 2012, the data revolution was underway. Thanks to the intersection of the internet, mobility and cloud computing, the ensuing information explosion generated unique opportunities for entrepreneurs to deliver products that create and transform industries.
I have seen a lot of changes in the start up landscape in the six years since we opened our doors (or arguably our laptops), but two things have remained true:
As we sometimes do, let’s start at the end.
Even if a team develops the best-in-class AI algorithm in the world for a particular business purpose, the intense competition in AI-driven applications means that they’ll likely have a fleeting advantage without creating a unique and proprietary data set. AI algorithms are only as good as the data they process, so the two are tightly linked in creating solutions that create value. The world has too many smart people to scrape data from the internet or from public data sets and believe you’ll build a big business; it will only be a matter of time before someone else does the same with a better/smarter algorithm or UX. You’ll end up in a commodity race because everyone else will be able to scrape the same data from the same data sets as you. Public data sources can be a good starting point but innovators must plan to aggregate, curate and originate in order to retain a long term sustainable advantage.
In this world of applied AI, one way or another, you’ve got to find yourself in the data pipeline. That can be done by plugging into existing systems, providing intermediate applications that engages users, or partnering with pilot customers.
Guardian Analytics prevents fraud for banking transactions by integrating into the core banking platform, such as Fiserv or Bottomline. The bank is in the system already so using available APIs to integrate directly and train as transactions let them prevent more fraud with fewer false positives than traditional rules-based system. This approach often takes time since older systems may not have well documented APIs, but Guardian has been able to plug into those systems for a while. Now that it is integrated into 400+ financial institutions and has evaluated billions of transactions, it has an enduring advantage that will be hard to replicate.
Alation, the leading provider of Enterprise Data Catalogs (according to Gartner and Forrester) uses machine learning to help users find, understand and govern enterprise data. They built a SQL query tool that made it easier for them to see and operate on the data to answer questions like: what does this field mean, are HP and Hewlett Packard are the same thing in this query, who uses this field most frequently and in what Tableau reports is this field used.
Focal Systems uses computer vision and deep learning to improve customer experience and reduce costs in brick-and-mortar retail. Their goal is to eliminate “out of stocks” and deliver a frictionless self checkout solution. They began in local grocers deploying carts equipped with “Wayfinding Tablets.” As the tablets direct customers to specific items in a store, the devices take pictures in different light and angles and builds up the data in pursuit of delivering a reliable out of stock solution that meets grocer requirements. Full deployment of the out of stock solution similarly helps capture the variety and volume of images necessary to deliver self checkout as good or better as the Amazon Go store with much lower cost and available to everybody.
Aquabyte decided to partner closely with a handful of key pilot customers. The company built initial algorithms based on fish in test tanks and used that to reach pilot agreements with several Norwegian salmon farms. They are using these partnerships to get more advanced cameras into the water that capture the data required to deliver the first use case — detection of sea lice — in the real world. The basic pitch was, “Let’s get into production so we can solve an important problem today, and train our algorithms on data so we can solve even bigger problems over time.”
These are all companies using applied AI to solve real-world — often vertical — problems. But it is their aggregation of proprietary data that will ultimately allow them to derive useful and actionable insights that create a defensible moat of data assets that nobody else has.