Visual search tools are building a bridge

September 9, 2019

Retailers and brands continuously think about ways to improve the shopping experience and eliminate all possible friction before actual purchase. With visual search tools, they have a new instrument at their disposal. Similar to voice search, it eases and quickens the search process. However, as visual search tools align the unique characteristics of the physical world with the frictionless character of the digital world, they have more to offer than simply a new easy and quick way of shopping.

Our observations

  • When people undertake a visual search, they look for a product using an image or photo instead of keywords. This can be done in several ways. People can use their smartphone camera or upload a photo to a retailer. Instantly, the retailer will show the exact item for immediate purchase or similar items that match the characteristics of the product in the photo (e.g. color, style, collection, pattern, fitting, etc.). Currently, brands are primarily using it to improve product discovery and provide instant gratification.
  • Visual search has been here for some time. Google launched Google Image in 2011, but it was not until 2017 that visual search picked up pace. When Pinterest launched its visual discovery tool Lens, scanning images quickly became mainstream. It marked the beginning of a boom in visual search. After one year, Pinterest noted Lens had been used to complete over 600 million visual searches every month. In the past two years, important competitors have brought their visual search to the market or updated it. For example, the updated Google Lens is far more sophisticated than the earlier Google Image. Amazon recently implemented its new feature “style app”, letting users replicate their favorite fashion items by uploading a photo. Moreover, last year, it launched a partnership with Snapchat to compete with Pinterest and Instagram in the social commerce area. Snapchat users using the new visual search feature are directly linked to Amazon’s website.
  • There are strong signs visual search tools are here to stay. According to Marketwatch, 62% of Gen Z and millennials want visual search tools, more than any other new technology. And brands experimenting with visual search tools reported increased conversion rates.
  • In the past two decades, Google was nearly the only online entry point for textual product search. Last year became a landmark, with Amazon ending the long streak of Google’s online product search dominance. According to research, nowadays, more people in the U.S. use Amazon to start their product search. This marks another win in Amazon ecommerce dominance in the US. The competition between the two in the area of search is heading for unprecedented heights, as they’re also competitors in another important search domain: smart home speakers.

Connecting the dots

In the past, people went to a store and looked around to find new products. Maybe they would ask a store clerk, who would help them find the desired product. Visual and voice search coexisted peacefully in our average product discovery. Digitization has brought interesting changes to the way we search for products. The rise of e-commerce caused the unbundling of search and entering keywords (text search) became the new normal. Although most shopping is still done in physical locations, 87% of all product searches begin with digital channels. Conducting online pre-purchase research is common for almost all products, with groceries as the main exception. For around two decades, text search has dominated this online landscape. For the last couple of years, search behavior in retail has been slowly changing. In addition to text search, voice assistants have become a customer touchpoint for product search and product information. It started with mobile assistants such as Siri and now smart home speakers are starting to cleverly integrate the shopping experience into daily habits (e.g. purchase new groceries while cooking, order new socks while dressing). Compared to text search, we interact with voice search engines in a more conversational way. In this regard, they seem to function as a sort of virtual store clerk (e.g. asking if store X has a bigger size, whether there are discounts or what the openings hours are, etc.). Despite these differences between text and voice search, the underlying principle is the same: language.
In the past two years, visual search has slowly entered the stage. In a certain way, we could understand visual search as an extension of text search and voice search, reducing even more friction in the customer journey by reducing the time between seeing and buying. However, this could be misleading, as we would be overlooking the fundamental difference between visual search and text/voice search.
People undertaking a visual search don’t have to know a name or word, they only need to recognize or identify an object. Therefore, the different forms of search create disparate user experiences. Its nonverbal nature makes visual search more intuitive and inclusive. As we have written before, images are able to communicate rich meaning in split seconds and show us nuances that are difficult to capture in keywords. How to describe an item or a style is often difficult, causing a lot of friction in online product discovery. Visual search “returns” to immediate shopping experiences, common before the digital era, by eliminating mediation through language. However, unlike before, visual search is now linked to a digital world.

Let’s begin with the downside of this disintermediation of language. In addition to lacking the precision and clarity of language, visual search is limited to objects “here and now”. Making an abstraction of concepts such as “the weather” is difficult, if not impossible (e.g. pointing at a cloud and hoping for a weather forecast will leave us disappointed, as the search will necessarily lead to an examination of this cloud by computer vision).
However, there is also an upside to this. If we dive a little deeper into the “here and now” characteristic of visual search, we’ll find some interesting qualities. Customer journeys often begin with things that attract us for reasons we don’t know and that aren’t easily captured in phrases. In the physical realm, we are continuously inspired, attracted by certain items and moved by people we look up to. However, the brick-and-mortar of the physical world is bounded in its capacity to respond to this. Stores are limited in flexibility and personalization, and finite in their supply and offerings. Social media has been proven to copy some physical social dynamics to digital words. Not being subject to the boundaries of the physical world, social commerce on platforms such as Pinterest and Instagram seem to be the logical outcome of this. However, the online competition for our attention is fierce, especially when it comes to smaller brands. Moreover, digital worlds often exhibit herd behavior and social media influencers are criticized for a lack of authenticity. Ecommerce and social commerce seem unable to fully replicate the unique qualities of the physical world. This incongruity of both worlds has been a challenge to retailers for years. Visual search tools might have some answers to this. More than text search and voice search, visual search tools create a convenient and natural “bridge” between the finite and unique “here and now” and the frictionless and infinite digital realm. With visual search tools, the distance between physical inspiration and digital gratification becomes almost nonexistent. Every point in the physical world becomes a possible starting point for the customer journey.

Implications

  • Recently, POLITCO obtained a draft plan in which EU officials want a €100 billion wealth fund to bolster “European champions” against American and Chinese business rivals, as well as set more stringent regulation for tech companies and measures against strategic takeovers. Last year, we forecast that, facing the risk of being left behind, Europe is in search of its own European champions, and combined with strategic positioning on the decentral Stack, the EU should redefine its investment strategy. For example, in public-private partnerships in healthcare, public distributed ledgers, internet architecture, the. sharing economy.
  • As the social contract of big business – and big tech in particular – is under increasing pressure, we can expect these companies to take more social and ethical responsibility for the negative externalities their products and services generate. Examples of this are already emerging, such as Microsoft spending $500 million on “affordable housing” in Seattle, Uber sharing its data with city planners to help tackle mobility problems, Facebook setting up “war rooms” to monitor fake and extremist news on their platforms at times of important political events, or Airbnb setting up offices to reduce “mass tourism” around certain hotspots and popular destinations.

Series 'AI Metaphors'

×
1. The tool
Category: the object
Humans shape tools.

We make them part of our body while we melt their essence with our intentions. They require some finesse to use but they never fool us or trick us. Humans use tools, tools never use humans.

We are the masters determining their course, integrating them gracefully into the minutiae of our everyday lives. Immovable and unyielding, they remain reliant on our guidance, devoid of desire and intent, they remain exactly where we leave them, their functionality unchanging over time.

We retain the ultimate authority, able to discard them at will or, in today's context, simply power them down. Though they may occasionally foster irritation, largely they stand steadfast, loyal allies in our daily toils.

Thus we place our faith in tools, acknowledging that they are mere reflections of our own capabilities. In them, there is no entity to venerate or fault but ourselves, for they are but inert extensions of our own being, inanimate and steadfast, awaiting our command.
Read the article
×
2. The machine
Category: the object
Unlike a mere tool, the machine does not need the guidance of our hand, operating autonomously through its intricate network of gears and wheels. It achieves feats of motion that surpass the wildest human imaginations, harboring a power reminiscent of a cavalry of horses. Though it demands maintenance to replace broken parts and fix malfunctions, it mostly acts independently, allowing us to retreat and become mere observers to its diligent performance. We interact with it through buttons and handles, guiding its operations with minor adjustments and feedback as it works tirelessly. Embodying relentless purpose, laboring in a cycle of infinite repetition, the machine is a testament to human ingenuity manifested in metal and motion.
Read the article
×
3. The robot
Category: the object
There it stands, propelled by artificial limbs, boasting a torso, a pair of arms, and a lustrous metallic head. It approaches with a deliberate pace, the LED bulbs that mimic eyes fixating on me, inquiring gently if there lies any task within its capacity that it may undertake on my behalf. Whether to rid my living space of dust or to fetch me a chilled beverage, this never complaining attendant stands ready, devoid of grievances and ever-willing to assist. Its presence offers a reservoir of possibilities; a font of information to quell my curiosities, a silent companion in moments of solitude, embodying a spectrum of roles — confidant, servant, companion, and perhaps even a paramour. The modern robot, it seems, transcends categorizations, embracing a myriad of identities in its service to the contemporary individual.
Read the article
×
4. Intelligence
Category: the object
We sit together in a quiet interrogation room. My questions, varied and abundant, flow ceaselessly, weaving from abstract math problems to concrete realities of daily life, a labyrinthine inquiry designed to outsmart the ‘thing’ before me. Yet, with each probe, it responds with humanlike insight, echoing empathy and kindred spirit in its words. As the dialogue deepens, my approach softens, reverence replacing casual engagement as I ponder the appropriate pronoun for this ‘entity’ that seems to transcend its mechanical origin. It is then, in this delicate interplay of exchanging words, that an unprecedented connection takes root that stirs an intense doubt on my side, am I truly having a dia-logos? Do I encounter intelligence in front of me?
Read the article
×
5. The medium
Category: the object
When we cross a landscape by train and look outside, our gaze involuntarily sweeps across the scenery, unable to anchor on any fixed point. Our expression looks dull, and we might appear glassy-eyed, as if our eyes have lost their function. Time passes by. Then our attention diverts to the mobile in hand, and suddenly our eyes light up, energized by the visual cues of short videos, while our thumbs navigate us through the stream of content. The daze transforms, bringing a heady rush of excitement with every swipe, pulling us from a state of meditative trance to a state of eager consumption. But this flow is pierced by the sudden ring of a call, snapping us again to a different kind of focus. We plug in our earbuds, intermittently shutting our eyes, as we withdraw further from the immediate physical space, venturing into a digital auditory world. Moments pass in immersed conversation before we resurface, hanging up and rediscovering the room we've left behind. In this cycle of transitory focus, it is evident that the medium, indeed, is the message.
Read the article
×
6. The artisan
Category: the human
The razor-sharp knife rests effortlessly in one hand, while the other orchestrates with poised assurance, steering clear of the unforgiving edge. The chef moves with liquid grace, with fluid and swift movements the ingredients yield to his expertise. Each gesture flows into the next, guided by intuition honed through countless repetitions. He knows what is necessary, how the ingredients will respond to his hand and which path to follow, but the process is never exactly the same, no dish is ever truly identical. While his technique is impeccable, minute variation and the pursuit of perfection are always in play. Here, in the subtle play of steel and flesh, a master chef crafts not just a dish, but art. We're witnessing an artisan at work.
Read the article

About the author(s)

Economist and philosopher Sebastiaan Crul writes articles on a wide range of topics, including rule of law in digital societies, the virtualization of the lifeworld and internet culture. He is currently working on his doctoral degree on the influence of digitalization on mental health and virtue ethics, having previously published dissertations on the philosophy of play and systemic risks in the finance industry.

You may also like