DeepProfiling using Clickstream Logs

DeepProfiling using Clickstream Logs

DeepProfiling using Clickstream Logs


The advertising industry has drastically changed over time with the innovation of new technology and platforms. Online purchasing has evidently become more powerful these days. Advertisers need to be meticulous in targeting the right audience with optimal effectiveness and creativity.

Marketers intend to invest a lot in advertising their business online but, they do not receive the expected benefits. Why so? Let’s take a simple example here. Say you have a business selling baseball bats. So, the right audience for your ad would be males between the ages of 10-35 and the right place would be to advertise it somewhere where the context is about sports. Now, what if someone searches about bats (animals) and your ad show up? Or it gets displayed on some websites related to cosmetics. Doesn’t make sense at all, right? All your money and efforts go in vain. Therefore, acquiring and utilizing your digital data in the right way has become very vital to know who your highly valued customers are. Research has shown that digital targeting significantly improves the response to advertisements.

These are some of the points that should be taken care of for online advertising:

  • Ads should contain relevant topics and context related to the business.
  • Tracking the Real-time data of your interested customers.
  • Targeting the right consumers by their demographics.

One of our clients wanted to utilize their click stream streaming data to suggest to advertisers the right channel to drive their customers. Also, they wanted to track which ads attracted more impressions and clicks so that they could plan and allocate media budgets accordingly.

My team vigorously crawled colossal figures of websites to extract content for training the model. The crawled data had tonnes of unusable text that surrounded the real content of the web pages. The first job was to clean up the junk data and extract the core essence of the page. The compiled content was then tagged which were falling into one of the 32 IAB categories and further 200+ subcategories. A classification model was built using Spark using the tagged data and put in production.

Then the real-time click stream data was captured by our trackers placed on the advertiser’s website. Kafka was used for live streaming of the data. Then, user profiling was done based on the semantics of the content which was captured through word embeddings with the above classifier. User profiling was performed considering various data points like age, gender, Named Entity Recognition (NER), location, purchase intent, vertical preferences, etc. to segment and target customers on various parameters. Translation APIs were used to convert the content from multiple languages to English. Finally, the Kafka data augmented with the derived data points were then pushed into Cassandra to target these users and provide insights to the advertisers.

We helped the client to come across insights wherein they could provide data-driven reports to the advertisers to invest properly in ads and attract the right traffic. Thus, analyzing data in an effective manner helped the client to double their profits.

Recent Posts

Subscribe to Newsletter

Stay updated with the blogs by subscribing to the newsletter