Google-sponsored research helps ad-slingers find fanbois on social networks

'We noticed you were holding an iPhone at a friend's wedding, need any accessories?'


Next time you post an image of your new El Reg prosthetic beak on the internet, be aware that marketers will soon have a tool to let them spot it, isolate it, and figure out what it's pictured alongside.

In a paper to be delivered in Sydney today, academics from Carnegie Mellon University demonstrate a new approach to identifying brands in images and spotting their associations.

The "Inferring Brand Associations from Web Community Photos" paper goes beyond traditional text-based analysis tools and wades into the nascent field of image recognition and classification.

"With experiments on about five million of images of 48 brands crawled from five popular online photo sharing sites, we demonstrate that our approach can discover complementary views on the brand associations that are hardly mined by traditional text-based methods," they write.

The approach "can detect the regions of brand more successfully than other candidate methods," they say.

CMUImage

The CMU tech can identify the context in which brands appear, and isolate them

The tech has two key elements: one part of it helps isolate brands from their surroundings, such as picking out the golden arches of McDonalds that lurk in the background of a family photo, and a second component groups these isolated brands by associations.

To gather the data, the scientists trawled Flickr, Photobucket, DeviantArt, Pinterest, and Twitpic for images that had been labelled with one of 48 brands in either meta-data or tags.

They then tested these images against an algorithm containing as input a selection of images of each of the 48 brands, which helped them identify various "exemplars" — frequently depicted things containing the brand, such as famous football players.

"Given millions of images retrieved by the query Nike, we cluster the images into L number of groups using visual similarity between images. (L is given by a user, ex. L=1000)," CMU researcher and one of the paper's authors Gunhee Kim told El Reg by email. "The exemplar is the cluster center (i.e., most representative image of the cluster). If a cluster is larger (i.e., include more images), it and its exemplar will be ranked higher. (For example, if the Nike set contains 10,000 jogging images and 5,000 soccer images, jogging image will be higher ranked)."

With the exemplars, they can then create clusters of the most-depicted associations of a brand, and can also refine their algorithms for separating the branded elements out from a picture.

"Exemplars are used to visualize the brand association map as a concise but comprehensive set of key visual themes associated with brands," the researchers write. "Second, for brand localization, the exemplars are used as references to detect the most brand-like regions in clustered images."

Given enough data, the approach can give marketers a new way to figure out where their brands are appearing most frequently – valuable information for use in future advertising campaigns.

Other potential applications include "photo-based competitor mining," and "contextual advertising for images," according to documents received by El Reg from CMU researchers.

"As output of our work, we can discover the dominant image clusters of each brand and image regions that are most associated with the brand, from the pictures posted by millions of users. Therefore, given a novel image, we can identify which brands are most relevant to the whole image (or regions of the image). For example, for the group of luxury brands, if the new image includes a watch and a bag, we can first generate bounding boxes for the two regions, and retrieve the Rolex and Louis Vuitton as the most relevant advertisement, respectively," they write.

Carnegie Mellon University is one of the premiere institutions in America for developing computer-aided analysis of web content, and is frequently favored with grants from Google and the National Science Foundation for its oh-so-ad-friendly research.

Other projects that CMU faculty are working on include the "Never Ending Image Learner" (NEIL) tech, which scans the web and learns the associations of various images, and its predecessor the "Never-Ending Language Learner" (NELL).

Though these projects are all developing new techniques for figuring out the associations and patterns within and between large collections of images, they still depend on pre-determined data such as the tags of images. This is an area in which Google may be able to help, given that its own machine learning platform recently figured out how to identity paper shredders all by itself. ®

Similar topics

Broader topics


Other stories you might like

  • Monero-mining botnet targets Windows, Linux web servers
    Sysrv-K malware infects unpatched tin, Microsoft warns

    The latest variant of the Sysrv botnet malware is menacing Windows and Linux systems with an expanded list of vulnerabilities to exploit, according to Microsoft.

    The strain, which Microsoft's Security Intelligence team calls Sysrv-K, scans the internet for web servers that have security holes, such as path traversal, remote file disclosure, and arbitrary file download bugs, that can be exploited to infect the machines.

    The vulnerabilities, all of which have patches available, include flaws in WordPress plugins such as the recently uncovered remote code execution hole in the Spring Cloud Gateway software tracked as CVE-2022-22947 that Uncle Sam's CISA warned of this week.

    Continue reading
  • Red Hat Kubernetes security report finds people are the problem
    Puny human brains baffled by K8s complexity, leading to blunder fears

    Kubernetes, despite being widely regarded as an important technology by IT leaders, continues to pose problems for those deploying it. And the problem, apparently, is us.

    The open source container orchestration software, being used or evaluated by 96 per cent of organizations surveyed [PDF] last year by the Cloud Native Computing Foundation, has a reputation for complexity.

    Witness the sarcasm: "Kubernetes is so easy to use that a company devoted solely to troubleshooting issues with it has raised $67 million," quipped Corey Quinn, chief cloud economist at IT consultancy The Duckbill Group, in a Twitter post on Monday referencing investment in a startup called Komodor. And the consequences of the software's complication can be seen in the difficulties reported by those using it.

    Continue reading
  • Infosys skips government meeting – and collecting government taxes
    Tax portal wobbles, again

    Services giant Infosys has had a difficult week, with one of its flagship projects wobbling and India's government continuing to pressure it over labor practices.

    The wobbly projext is India's portal for filing Goods and Services Tax returns. According to India's Central Board of Indirect Taxes and Customs (CBIC), the IT services giant reported a "technical glitch" that meant auto-populated forms weren't ready for taxpayers. The company was directed to fix it and CBIC was faced with extending due dates for tax payments.

    Continue reading
  • Google keeps legacy G Suite alive and free for personal use
    Phew!

    Google has quietly dropped its demand that users of its free G Suite legacy edition cough up to continue enjoying custom email domains and cloudy productivity tools.

    This story starts in 2006 with the launch of “Google Apps for Your Domain”, a bundle of services that included email, a calendar, Google Talk, and a website building tool. Beta users were offered the service at no cost, complete with the ability to use a custom domain if users let Google handle their MX record.

    The service evolved over the years and added more services, and in 2020 Google rebranded its online productivity offering as “Workspace”. Beta users got most of the updated offerings at no cost.

    Continue reading
  • GNU Compiler Collection adds support for China's LoongArch CPU family
    MIPS...ish is on the march in the Middle Kingdom

    Version 12.1 of the GNU Compiler Collection (GCC) was released this month, and among its many changes is support for China's LoongArch processor architecture.

    The announcement of the release is here; the LoongArch port was accepted as recently as March.

    China's Academy of Sciences developed a family of MIPS-compatible microprocessors in the early 2000s. In 2010 the tech was spun out into a company callled Loongson Technology which today markets silicon under the brand "Godson". The company bills itself as working to develop technology that secures China and underpins its ability to innovate, a reflection of Beijing's believe that home-grown CPU architectures are critical to the nation's future.

    Continue reading
  • China’s COVID lockdowns bite e-commerce players
    CEO of e-tail market leader JD perhaps boldly points out wider economic impact of zero-virus stance

    The CEO of China’s top e-commerce company, JD, has pointed out the economic impact of China’s current COVID-19 lockdowns - and the news is not good.

    Speaking on the company’s Q1 2022 earnings call, JD Retail CEO Lei Xu said that the first two years of the COVID-19 pandemic had brought positive effects for many Chinese e-tailers as buyer behaviour shifted to online purchases.

    But Lei said the current lengthy and strict lockdowns in Shanghai and Beijing, plus shorter restrictions in other large cities, have started to bite all online businesses as well as their real-world counterparts.

    Continue reading
  • Foxconn forms JV to build chip fab in Malaysia
    Can't say when, where, nor price tag. Has promised 40k wafers a month at between 28nm and 40nm

    Taiwanese contract manufacturer to the stars Foxconn is to build a chip fabrication plant in Malaysia.

    The planned factory will emit 12-inch wafers, with process nodes ranging from 28 to 40nm, and will have a capacity of 40,000 wafers a month. By way of comparison, semiconductor-centric analyst house IC Insights rates global wafer capacity at 21 million a month, and Taiwanese TSMC’s four “gigafabs” can each crank out 250,000 wafers a month.

    In terms of production volume and technology, this Malaysian facility will not therefore catapult Foxconn into the ranks of leading chipmakers.

    Continue reading
  • NASA's InSight doomed as Mars dust coats solar panels
    The little lander that couldn't (any longer)

    The Martian InSight lander will no longer be able to function within months as dust continues to pile up on its solar panels, starving it of energy, NASA reported on Tuesday.

    Launched from Earth in 2018, the six-metre-wide machine's mission was sent to study the Red Planet below its surface. InSight is armed with a range of instruments, including a robotic arm, seismometer, and a soil temperature sensor. Astronomers figured the data would help them understand how the rocky cores of planets in the Solar System formed and evolved over time.

    "InSight has transformed our understanding of the interiors of rocky planets and set the stage for future missions," Lori Glaze, director of NASA's Planetary Science Division, said in a statement. "We can apply what we've learned about Mars' inner structure to Earth, the Moon, Venus, and even rocky planets in other solar systems."

    Continue reading

Biting the hand that feeds IT © 1998–2022