Obtenez par e-mail toute l'actualité Hortonworks

Une fois par mois, recevez les dernières idées, tendances, informations d’analyse et découvertes sur le Big Data.


Sign up for the Developers Newsletter

Une fois par mois, recevez les dernières idées, tendances, informations d’analyse et découvertes sur le Big Data.




Prêt à débuter ?

Télécharger Sandbox

Que pouvons-nous faire pour vous ?

* Je comprends que je peux me désabonner à tout moment. J'ai également compris les informations supplémentaires fournies dans la Politique de confidentialité de Hortonworks.
fermerBouton Fermer
HDP > Développez avec Hadoop > Exemples utilisant des données réelles

Visualize Website Clickstream Data

cloud Prêt à débuter ?



Your home page looks great. But how do you move customers on to bigger things – like submitting a form or completing a purchase? Get more granular with customer segmentation. Hadoop makes it easier to analyze, visualize and ultimately change how visitors behave on your website.

We will cover an established use case for Hadoop: deriving insights from large data sources such as web logs. By combining web logs with more traditional customer data, we can better understand customers and understand how to optimize future promotions and advertising. We demonstrate how an online retailer can optimize buying paths to reduce bounce rate and improve conversion.

Clickstream Data

Clickstream data is an information trail a user leaves behind while visiting a website. It is typically captured in semi-structured website log files.

These website log files contain data elements such as a date and time stamp, the visitor’s IP address, the URLs of the pages visited, and a user ID that uniquely identifies the user.

Potential Uses of Clickstream Data

One of the original uses of Hadoop at Yahoo was to store and process their massive volume of clickstream data. Now enterprises of all types can use Hadoop and the Hortonworks Data Platform (HDP) to refine and analyze clickstream data. They can then answer business questions such as:

  • What is the most efficient path for a site visitor to research a product, and then buy it?
  • What products do visitors tend to buy together, and what are they most likely to buy in the future?
  • Where should I spend resources on fixing or enhancing the user experience on my website?

In this tutorial, we will focus on the “path optimization” use case. Specifically: how can we improve our website to reduce bounce rates and improve conversion?

Conditions préalables

En bref

  • Visualize Log Data with Apache Zeppelin – Use Apache Zeppelin to analyze logs for customer demographics and their product interests.
  • Visualize Log Data with Microsoft Excel – Use Microsoft Excel to map user demographics and graph log data.

Avis des utilisateurs

Note des utilisateurs
0 No Reviews
5 Star 0%
4 Star 0%
3 Star 0%
2 Star 0%
1 Star 0%
Nom du tutoriel
Visualize Website Clickstream Data

Pour poser une question, ou trouver une réponse, veuillez consulter la page Hortonworks Community Connection.

No Reviews
Écrire un avis


Veuillez vous inscrire pour écrire un avis

Partager votre expérience

Par exemple : Le meilleur tutoriel qui soit

Vous devez saisir au moins 50 caractères dans ce champ.


Merci d'avoir partagé votre avis !