Doctor Google stitches eyes, ears onto its cloudy learning monster

Video recog, audio training data pave way to more intelligent machines


Google Next '17 Following on the heels of its image dataset dump last year to aid the training of image recognition models, Google has done the same for sound, even as it beefs up the visual machine learning capabilities available through its Cloud Platform.

At the Google Cloud Next '17 event on Wednesday, Fei-Fei Li, chief scientist of Google's cloud-based machine learning efforts, introduced the company's Cloud Video Intelligence API. Presently in private beta, the service allows developers to query video content using words that describe depicted objects and their characteristics.

This can already be done in a limited way with video that has been annotated with metadata or associated with closed-captioned descriptions.

But Google – by analyzing every video frame using image recognition algorithms and data models developed through machine learning – can provide a far more comprehensive set of data for finding things depicted in videos through its API.

Amazon, for what it's worth, launched something similar, called Rekognition, last year.

Google Cloud Machine Learning and Cloud Data Lab both entered general availability, which matters to enterprise customers. The Cloud Vision API 1.1 (still in beta) can now recognize entities from Google's Knowledge Graph and has gained OCR capabilities to better extract text from documents. And the Cloud Jobs API gained the ability to query employment opportunities based on commute times and preferred modes of transportation.

Also, Google has launched what it's calling its Advanced Solutions Lab, a facility for training enterprise customers to train their machines to learn.

Meanwhile, the Mountain View, California-based data hoarder has opened another section of its attic to reveal AudioSet, a collection of more than 2 million 10-second YouTube clips annotated with sound labels and categorized into more than 600 different classes.

The AudioSet ontology, or data model, describes the audio content in terms of event categories, which include human and animal sounds, musical instruments and genres, and environmental sounds.

For example, one JSON object described in the model is labelled "Chuckle, chortle." It includes a citation field that points to a text definition of the sound and it lists seven YouTube clips, with embedded start and end times, that present audible examples of chuckling.

Google's goal for this sound and fury is to help researchers and developers train machine learning models that can be made available to applications for identifying sounds.

Those working with images can already avail themselves of a variety of public data sets for model training. Music and speech datasets are also available, though there are fewer of them.

Now those looking to teach their software to identify, say, throat clearing or the rustling of leaves, don't have to resort to collecting their own training samples. ®

Broader topics


Other stories you might like

  • Cheers ransomware hits VMware ESXi systems
    Now we can say extortionware has jumped the shark

    Another ransomware strain is targeting VMware ESXi servers, which have been the focus of extortionists and other miscreants in recent months.

    ESXi, a bare-metal hypervisor used by a broad range of organizations throughout the world, has become the target of such ransomware families as LockBit, Hive, and RansomEXX. The ubiquitous use of the technology, and the size of some companies that use it has made it an efficient way for crooks to infect large numbers of virtualized systems and connected devices and equipment, according to researchers with Trend Micro.

    "ESXi is widely used in enterprise settings for server virtualization," Trend Micro noted in a write-up this week. "It is therefore a popular target for ransomware attacks … Compromising ESXi servers has been a scheme used by some notorious cybercriminal groups because it is a means to swiftly spread the ransomware to many devices."

    Continue reading
  • Twitter founder Dorsey beats hasty retweet from the board
    As shareholders sue the social network amid Elon Musk's takeover scramble

    Twitter has officially entered the post-Dorsey age: its founder and two-time CEO's board term expired Wednesday, marking the first time the social media company hasn't had him around in some capacity.

    Jack Dorsey announced his resignation as Twitter chief exec in November 2021, and passed the baton to Parag Agrawal while remaining on the board. Now that board term has ended, and Dorsey has stepped down as expected. Agrawal has taken Dorsey's board seat; Salesforce co-CEO Bret Taylor has assumed the role of Twitter's board chair. 

    In his resignation announcement, Dorsey – who co-founded and is CEO of Block (formerly Square) – said having founders leading the companies they created can be severely limiting for an organization and can serve as a single point of failure. "I believe it's critical a company can stand on its own, free of its founder's influence or direction," Dorsey said. He didn't respond to a request for further comment today. 

    Continue reading
  • Snowflake stock drops as some top customers cut usage
    You might say its valuation is melting away

    IPO darling Snowflake's share price took a beating in an already bearish market for tech stocks after filing weaker than expected financial guidance amid a slowdown in orders from some of its largest customers.

    For its first quarter of fiscal 2023, ended April 30, Snowflake's revenue grew 85 percent year-on-year to $422.4 million. The company made an operating loss of $188.8 million, albeit down from $205.6 million a year ago.

    Although surpassing revenue expectations, the cloud-based data warehousing business saw its valuation tumble 16 percent in extended trading on Wednesday. Its stock price dived from $133 apiece to $117 in after-hours trading, and today is cruising back at $127. That stumble arrived amid a general tech stock sell-off some observers said was overdue.

    Continue reading
  • Amazon investors nuke proposed ethics overhaul and say yes to $212m CEO pay
    Workplace safety, labor organizing, sustainability and, um, wage 'fairness' all struck down in vote

    Amazon CEO Andy Jassy's first shareholder meeting was a rousing success for Amazon leadership and Jassy's bank account. But for activist investors intent on making Amazon more open and transparent, it was nothing short of a disaster.

    While actual voting results haven't been released yet, Amazon general counsel David Zapolsky told Reuters that stock owners voted down fifteen shareholder resolutions addressing topics including workplace safety, labor organizing, sustainability, and pay fairness. Amazon's board recommended voting no on all of the proposals.

    Jassy and the board scored additional victories in the form of shareholder approval for board appointments, executive compensation and a 20-for-1 stock split. Jassy's executive compensation package, which is tied to Amazon stock price and mostly delivered as stock awards over a multi-year period, was $212 million in 2021. 

    Continue reading
  • Confirmed: Broadcom, VMware agree to $61b merger
    Unless anyone out there can make a better offer. Oh, Elon?

    Broadcom has confirmed it intends to acquire VMware in a deal that looks set to be worth $61 billion, if it goes ahead: the agreement provides for a “go-shop” provision under which the virtualization giant may solicit alternative offers.

    Rumors of the proposed merger emerged earlier this week, amid much speculation, but neither of the companies was prepared to comment on the deal before today, when it was disclosed that the boards of directors of both organizations have unanimously approved the agreement.

    Michael Dell and Silver Lake investors, which own just over half of the outstanding shares in VMware between both, have apparently signed support agreements to vote in favor of the transaction, so long as the VMware board continues to recommend the proposed transaction with chip designer Broadcom.

    Continue reading

Biting the hand that feeds IT © 1998–2022