As robotics boom, startups race to capture real-world AI training data
Chennai: Home services startup Pronto’s admission last week that it was piloting in-home video recordings to train physical AI systems shines a light on a fast-growing and loosely regulated industry of AI data capture and labelling for the global robotics supply chain.
Pronto is not alone. Startups such as Human Archive, Humyn Labs, Egolab AI and Neocambrian are collecting what is called egocentric data or first-person video captured through wearables or head-mounted cameras. They partner with cloud kitchens, hotels, home services platforms, small textile and garment factories, and warehouse operators to record everyday tasks from cooking meals and washing dishes to stitching garments, assembling components and sorting inventory. In some cases, startups have built dedicated ‘data factories’ with motion-tracking rigs.
“Typical clients are robotics, vision-language-action model and world model companies,” said Abhinav Kukreja, founder of Neocambrian AI, which raised funds from angels, including Dalmia Family Office Trust. “There is no equivalent repository of physical behaviour on the internet. Robots need to learn from messy homes, crowded factories, small shops and repair stations, which India offers. When done right, it can become an additional source of paid work for many workers and households, and we compensate both environment owners and data collectors,” he said.
This data trains world models and physical AI systems, teaching robots to navigate and act in messy, unstructured environments and smart glasses for object recognition. One industry insider said there is significant demand from the defence industry, particularly for autonomous drone applications. The practice also raises questions about privacy, legality and compensation as in some cases videos are recorded without pay and consent from the workers. TOI learnt that some factories have paused such pilots after the recent backlash.
Manish Agarwal, co-founder of Humyn Labs, which works with leading frontier labs, said demand is growing from robotics OEMs, software makers and enterprises. “We collect and convert this into episodic strings for robot memory, which helps build low to mid-level agentic capabilities including physical action, voice, sight and mobility,” he said. “We are using verified networks of workers across 16 countries as robots cannot be trained only in Indian environments. For European domestic robotics to navigate better, we need training data similar to that environment,” he added.
Startups argue that this is India’s entry into the global AI value chain, and that working with frontier labs could help the country train competitive models of its own in the future. But sceptics see a familiar cost-arbitrage play. Madhukar Yarra, CEO of Bengaluru BPO NextWealth, which annotates these videos, called it a flash in the pan. Much of the data is collected through unorganised gig work, he said.
Sangeeta Gupta, SVP at Nasscom, said physical AI data could diversify India’s AI services beyond traditional data labelling. “But issues around informed consent, anonymisation, worker awareness and ethical use will require continued industry responsibility and evolving safeguards,” she said.
“Typical clients are robotics, vision-language-action model and world model companies,” said Abhinav Kukreja, founder of Neocambrian AI, which raised funds from angels, including Dalmia Family Office Trust. “There is no equivalent repository of physical behaviour on the internet. Robots need to learn from messy homes, crowded factories, small shops and repair stations, which India offers. When done right, it can become an additional source of paid work for many workers and households, and we compensate both environment owners and data collectors,” he said.
This data trains world models and physical AI systems, teaching robots to navigate and act in messy, unstructured environments and smart glasses for object recognition. One industry insider said there is significant demand from the defence industry, particularly for autonomous drone applications. The practice also raises questions about privacy, legality and compensation as in some cases videos are recorded without pay and consent from the workers. TOI learnt that some factories have paused such pilots after the recent backlash.
Manish Agarwal, co-founder of Humyn Labs, which works with leading frontier labs, said demand is growing from robotics OEMs, software makers and enterprises. “We collect and convert this into episodic strings for robot memory, which helps build low to mid-level agentic capabilities including physical action, voice, sight and mobility,” he said. “We are using verified networks of workers across 16 countries as robots cannot be trained only in Indian environments. For European domestic robotics to navigate better, we need training data similar to that environment,” he added.
Startups argue that this is India’s entry into the global AI value chain, and that working with frontier labs could help the country train competitive models of its own in the future. But sceptics see a familiar cost-arbitrage play. Madhukar Yarra, CEO of Bengaluru BPO NextWealth, which annotates these videos, called it a flash in the pan. Much of the data is collected through unorganised gig work, he said.
Sangeeta Gupta, SVP at Nasscom, said physical AI data could diversify India’s AI services beyond traditional data labelling. “But issues around informed consent, anonymisation, worker awareness and ethical use will require continued industry responsibility and evolving safeguards,” she said.
Comments
Be the first to share a thought and become theFirst Voiceof this News Article
end of article
In Chennai
- As robotics boom, startups race to capture real-world AI training data
- PV sales stay resilient in May as industry passes headwinds test
- PV sales stay resilient in May as industry passes headwinds test
- PE-VC investments dip 9% YoY during Jan-May
- HC restrains police from filing chargesheet against YouTuber
- Police team on ganja raid attacked by kin of peddler
- Bar licences come under police scrutiny
Featured In City
- ‘Can’t forget roots’: Shankar promises to raise NB issues
- ‘Don’t take law into your hands’: CM appeals after attack on Abhishek
- RMLIMS docs perform robotic surgery on patient with rare kidney cancer
- Bishal Lama: 1st Gorkha minister in 5 decades
- City boy Anvesh Patel clinches AIR 68 in JEE Advanced
- IN-SPACe invites Pvt firms to set up Ground Stations At Isro’s NRSC
- Bellatrix partners with Korean firm for next-gen EO satellite
Photostories
- Tracing the Indian Art forms that conquered the world
- Cucumber (Kheera) vs Snake Cucumber (Kakdi): Which is more hydrating and how much to consume daily
- Katrina Kaif’s post-pregnancy style era is here, and it starts with a killer black overcoat
- Hollywood's ugliest custody battles: From Brad Pitt and Angelina Jolie to Rob Kardashian and Blac Chyna
- Is Anushka Sharma’s white ensemble RCB’s new lucky charm? A throwback to her 2025 IPL finale look
- Top 10 Indian cities where property prices have risen the most in 2026
- From the elite class's hobby to contemporary decorative: How did bonsai making turn into a modern-day art form?
- Love quote of the day by Louis de Bernières: ‘Love is not breathlessness; it’s not excitement’
- How Ranveer Singh and Farhan Akhtar’s friendship exploded over ‘Don 3’: Inside Rs 45 crore fallout that led to FWICE directive
- 9 stunning places to visit in Lahaul Valley after crossing the Atal Tunnel in Himachal Pradesh
Videos
04:49 'Apologise Immediately': Nepal Opposition Rips Into PM Balen Shah Over India Border Remark04:20 After Amit Shah Meeting, Annamalai Exit Speculation Grows As New Party Talk Intensifies06:58 CBSE-Coempt Dispute Escalates Amid Conflict Of Interest Claims And Strong Denials03:50 'Even Hitler Did Not...': Mamata Slams BJP Over 'Police Raj' In Bengal, Attack On Abhishek Banerjee05:56 'Cooker Only' Audio Row Deepens Congress Rift As Zameer Denies Viral Recording Claims | Watch05:14 Sanjay Singh Confronts Police Officials During Student Interaction Over Exam Paper Leaks03:09 Monkey Snatches ₹2 Lakh Bag In UP Court, Climbs Tree And Showers Currency Notes From Above04:45 India-US Trade Deal Nears Finish Line, First Tranche May Be Signed Soon: Piyush Goyal05:03 TMC Expels Two MLAs, Ritabrata Banerjee And Sandipan Saha, Amid Signature Mismatch Row
Hot Picks
Top Trends
Up Next
Follow Us On Social Media