Obtenez par e-mail toute l'actualité Hortonworks

Une fois par mois, recevez les dernières idées, tendances, informations d’analyse et découvertes sur le Big Data.


Sign up for the Developers Newsletter

Une fois par mois, recevez les dernières idées, tendances, informations d’analyse et découvertes sur le Big Data.




Prêt à débuter ?

Télécharger Sandbox

Que pouvons-nous faire pour vous ?

* Je comprends que je peux me désabonner à tout moment. J'ai également compris les informations supplémentaires fournies dans la Politique de confidentialité de Hortonworks.
fermerBouton Fermer

Hortonworks Data Steward Studio

Comprenez, sécurisez et contrôlez les données dans les data lakes de l'entreprise.

Taking a modern approach to managing your data

Téléchargez le livre blanc


Data Steward Studio (DSS) is a DataPlane Service that empowers users to understand, secure, and govern data across enterprise data lakes. DSS empowers enterprises to precisely identify and evaluate the integrity of their data in order to securely collaborate and confidently democratize it across the enterprise.

DSS enables enterprises to contextualize knowledge about the data located across hybrid data lakes which empowers them to generate actionable insights and take meaningful actions about their business operations.

video imgbouton de la vidéo

Data Steward Studio


Discover and classify data across data lakes

DSS features out-of-the-box profilers that can run as a pipeline of operations on data located across multiple data lakes. Customers can install the profiler agent in a data lake and set up a specific schedule to generate various types of data profiles. DSS empowers data stewards to:

  • Understand enterprise data based on sensitivity and distribution characteristics
  • Get visibility into the number of tables that have been added every day
  • Receive operational metrics including the number of partitions, time of creation, table size, number of rows, input and output format
Blog: Understand your hybrid data lakes to exploit their business value!
Blog: Forrester Recognizes Hortonworks as a Strong Performer in Big Data Fabric Wave
Découvrez les sources de données des data lakes
Comprenez les données de l'entreprise

DSS provides all the metadata associated with a particular data asset tracked by Apache Atlas. With DSS, data stewards are able to:

  • Get end-to-end visibility into data provenance, origin, lineage 
and impact
  • Understand how data is created and modified
  • Visualize upstream lineage and downstream impact
  • Discern how schema or data has evolved over time
Webinar: Path to GDPR Compliance Begins with Data Governance – Live Panel
Comprenez les données de l'entreprise
Comply with regulations

DSS displays all the audit events associated with a particular data asset through Apache Ranger. With DSS, internal and external auditors are empowered to:

  • Get visibility into who has accessed which data from a forensic audit or compliance perspective
  • Visualize access patterns, identify anomalies and ensure proper control mechanisms
  • View the most recent raw audit events, as well as summarized views of audits by type of access and access outcome
White Paper: Path to GDPR Compliance Begins with Data Governance
Respectez les réglementations
Proposez des données fiables pour l'activité

DSS enables data consumers and stewards to create Asset Collections to group heterogenous data assets based on business definition. Asset Collections can be created based on categories such as customer profiles, sales assets, financials, PII, and HR data. By creating Asset Collections, data stewards and data consumers can:

  • Automatisez les stratégies d'utilisation, de conservation et de restauration des données.
  • Organize data into asset collections based on business classifications, purpose, protections and relevance
  • Search data in the data lake using tags, attribute facets, or free text
  • Get an overview of data assets within an asset collection through intuitive dashboards
Press Release: Data Steward Studio Helps Enterprises Across Cloud and On-Prem Data Lakes
Proposez des données fiables pour l'activité