
Smart cameras are to be found in millions of homes, but the truth is theyre not all that smart.
Facial recognition and motion detection are their main tricks but what if you want to know if the dog jumped on the couch, or if your toddler is playing with the stove? Visual One equips cameras with the intellect to understand a bit more of the world and give you more granular and important information.Founder Mohammad Rafiee said that the idea came to him after he got a puppy (Zula) and was dissatisfied with the options he had for monitoring her activities while he was away.
Here she is doing what dogs do best:There are no bad dogs, but chairs are for peopleThere were specific things I wanted to know were happening, like I wanted to check if the dog got picked up by the dog walker.
The cameras motion detection is useless shes always moving, he lamented.
In fact, with a lot of these cameras, just a change in the lighting or wind or rain can trigger the motion alert, so its completely impractical.My background is in machine learning.
I was thinking about it, and realized were at a stage where this problem is starting to become solvable, he continued.Some tasks in computer vision, indeed, are as good as solved detecting faces and common objects such as cars and bikes can be done quickly and efficiently.
But thats not always useful whats the point of knowing someone rode their bike past your house? In order for this to have value, the objects need to be understood as part of a greater context, and thats what Rafiee and Visual One are undertaking.Unfortunately, its far from easy or else everyone would be doing it already.
Identifying a cat is simple, and identifying a table is simple, but identifying a cat on a table is surprisingly hard.Its a very difficult problem.
So were breaking it down to things we can solve right now, then building on that, Rafiee explained.
With deep learning techniques we can identify different objects, and we build models on top of those to specify different interactions, or specific objects being in specific locations.
Like a car in the wrong spot, or a dog getting on a couch.
We can recognize that with high accuracy right now we have a list of supported objects and models that were expanding.In case youre not convinced that the capabilities are that much advanced from the usual activity in the living room or Kendra is at the front door notifications, here are a few situations that Visual One is set up to detect:Kid playing with the stoveToddler climbing furnitureKid holding a knifeBaby left alone for too longRaccoon getting into garbageElderly person taking her medicationsElderly person in bed for too longCar parked in the wrong spotGarage door left openDog chewing on a shoeCat scratching the furnitureThe process for creating these triggers is pretty straightforwardIf one of those doesnt make you think actually that would be really good to know, then perhaps a basic security camera is enough for your purposes after all.
Not everyone has a knife-curious toddler.
But those of you who do are probably scrolling furiously past this paragraph looking for where to buy one of these things.Unfortunately Visual One isnt something you can just install on any old existing system with the prominent exception of Nest, into which it can plug.
Camera workflows are generally too locked down for security and privacy purposes to allow for third-party apps and services to be slipped in.
But the company isnt trying to bankrupt everyone with an ultra-luxury offering.
Its using off-the-shelf cameras from Wyze and loading them with its own software stack.Rafiee said he pictures Visual One as a mid-tier option for people who want to have more than a basic camera setup but arent convinced by the more expensive plays.
That way the company avoids going head-on with commodity hardwares race to the bottom or the brand warfare taking place between Google and Amazons Nest and Ring.
Cameras cost $30-$40, and the service is $7 per month currently.Ultimately the low-end companies may want to license from Visual One, while the high-end companies will be developing their own full stack at great cost, making it difficult for them to go downmarket.
Hardware is hard, and AI is specialized unless youre a giant company its hard to do both.
I think we can fill the gap in the market for mid-market companies without those resources, he said.Of course privacy is paramount as well, and Rafiee said that because of the way their system works, although the AI lives in the cloud and therefore requires the cameras to be online (like most others), no important user data needs to be or will be stored on Visual One servers.
We do inference in the cloud so we can be hardware agnostic, but we dont need to store any data.
So we dont add any risk, he said.Visual One is launching today (after a stint in YCs latest cohort) with an initial set of objects and interactions, and will continue developing more as it observes which use cases prove popular and effective.