Cloudera, Inc.

Cloudera Data Science Workbench

Cloudera Data Science Workbench (CDSW) enables fast, easy, and secure self-service data science for the enterprise. Data scientists can bring existing tools, such as R, Python, and Scala, to securely run on data. CDSW lets data scientists manage their own analytics pipelines, accelerating machine learning projects from exploration to production.

Features

  • Project Collaboration
  • Version Control
  • Secure access to data
  • Support for R, Scala and Python
  • Machine Learning and Analytics
  • Job Management
  • Machine Learning Model building
  • Machine Learning Model scoring
  • GPU support
  • Virtualisation and containers

Benefits

  • Fast, easy, and secure self-service data science for the enterprise
  • Data scientists can bring existing skills and tools
  • Securely run computations on data in Hadoop clusters
  • Interactive user sessions with Python, R, and Scala

Pricing

£3600 per instance per year

Service documents

G-Cloud 11

519949678972082

Cloudera, Inc.

James Underhill

07813563926

junderhill@cloudera.com

Service scope

Service scope
Software add-on or extension Yes
What software services is the service an extension to Cloudera Data Science Workbench requires a Data Science and Engineering, or Enterprise Data Hub subscription for each node that it is installed on.
Cloud deployment model
  • Public cloud
  • Private cloud
  • Hybrid cloud
Service constraints Cloudera Self Managed Cloud has restrictions on supported 3rd party components. This is described in Cloudera Reference Architectures which are maintained on our website.

Please refer to the service definition document for more information.
System requirements
  • Requires a supported OS and VM configuration
  • Requires Java 1.8+
  • Requires Spark 2.0+
  • Requires Cloudera EDH
  • Requires a wildcard DNS configuration

User support

User support
Email or online ticketing support Yes, at extra cost
Support response times We provide 24/7 email and online support as part of our subscription
User can manage status and priority of support tickets Yes
Online ticketing support accessibility None or don’t know
Phone support Yes
Phone support availability 24 hours, 7 days a week
Web chat support No
Onsite support Yes, at extra cost
Support levels CASE INITIAL RESPONSE UPDATE FREQUENCY TARGET PRIORITY TARGET 24x7SUBSCRIPTION 24x7 SUBSCRIPTION P1 Within 1 hour Updated every 4 hours P2 Within 2 hours Updated every business day P3 Within 8 hours Updated every 3 business days P4 Within 24 hours N/A, feature request CASE INITIAL RESPONSE TARGET UPDATE FREQUENCY TARGET PRIORITY 8x5 SUBSCRIPTION 8x5 SUBSCRIPTION P1 Within 1 business hour Updated every 4 business hours P2 Within 2 business hours Updated every business day P3 Within 8 business hours Updated every 3 business days P4 Within 2 business days N/A, feature request
Support available to third parties Yes

Onboarding and offboarding

Onboarding and offboarding
Getting started On procurement of the service, a Cloudera Account Executive will contact the customer to arrange onboarding. This will include delivery of a license key to enable the enterprise features of the Cloudera software and onboarding of Primary

Support Contact(s) from the customer organization.

A defined process exists whereby within 24 hours, an introduction email and license entitlement is provided to thenominated customer contact. A Proactive Customer Onboarding representative will set up a call within one week to arrange the details required to provide our support services.
Service documentation Yes
Documentation formats
  • HTML
  • PDF
End-of-contract data extraction Cloudera is built on Open Source Software and Open Standards. Once the contract ends the storage services can still be accessed and all customer owned data can be off-loaded.
End-of-contract process At the end of the contract the license key will expire and the customer will no longer be licensed to use the Enterprise features.

Using the service

Using the service
Web browser interface Yes
Supported browsers
  • Internet Explorer 10
  • Internet Explorer 11
  • Microsoft Edge
  • Firefox
  • Chrome
  • Safari 9+
Application to install Yes
Compatible operating systems
  • Linux or Unix
  • Other
Designed for use on mobile devices No
Service interface No
API Yes
What users can and can't do using the API The API enables the scheduling of Jobs. The platform also presents capabilities via a REST API for the development of micro services.
API documentation Yes
API documentation formats HTML
API sandbox or test environment No
Customisation available Yes
Description of customisation The service presents APIs for data models. These APIs can be configured by the end user.

Scaling

Scaling
Independence of resources Resource quotas and containerisation are used to isolate users from each other.

Analytics

Analytics
Service usage metrics Yes
Metrics types The platform can monitor CPU, Memory, GPU, Application Storage and session run statistics.
Reporting types
  • API access
  • Real-time dashboards

Resellers

Resellers
Supplier type Not a reseller

Staff security

Staff security
Staff security clearance Other security clearance
Government security clearance Up to Developed Vetting (DV)

Asset protection

Asset protection
Knowledge of data storage and processing locations Yes
Data storage and processing locations United Kingdom
User control over data storage and processing locations Yes
Datacentre security standards Supplier-defined controls
Penetration testing frequency Never
Protecting data at rest Encryption of all physical media
Data sanitisation process No
Equipment disposal approach A third-party destruction service

Data importing and exporting

Data importing and exporting
Data export approach Users can copy the data from the platform using a range of software including API, CLI and UI.
Data export formats
  • CSV
  • Other
Other data export formats
  • Avro
  • Parquet
  • Original Format
Data import formats
  • CSV
  • Other
Other data import formats
  • Avro
  • Parquet
  • Original Format

Data-in-transit protection

Data-in-transit protection
Data protection between buyer and supplier networks TLS (version 1.2 or above)
Data protection within supplier network TLS (version 1.2 or above)

Availability and resilience

Availability and resilience
Guaranteed availability Please refer to the service definition document.
Approach to resilience Please refer to the service definition document.
Outage reporting Please refer to the service definition document.

Identity and authentication

Identity and authentication
User authentication needed Yes
User authentication
  • Public key authentication (including by TLS client certificate)
  • Identity federation with existing provider (for example Google Apps)
Access restrictions in management interfaces and support channels The software uses Kerberos Principles to authenticate users and administrators.

Cloudera uses a SSO framework for support. All customers need to be registered with Cloudera in order to access management interfaces and support channels.
Access restriction testing frequency At least once a year
Management access authentication
  • Public key authentication (including by TLS client certificate)
  • Identity federation with existing provider (for example Google Apps)
  • Dedicated link (for example VPN)
  • Username or password

Audit information for users

Audit information for users
Access to user activity audit information Users have access to real-time audit information
How long user audit data is stored for User-defined
Access to supplier activity audit information Users contact the support team to get audit information
How long supplier audit data is stored for User-defined
How long system logs are stored for User-defined

Standards and certifications

Standards and certifications
ISO/IEC 27001 certification No
ISO 28000:2007 certification No
CSA STAR certification No
PCI certification No
Other security certifications No

Security governance

Security governance
Named board-level person responsible for service security Yes
Security governance certified Yes
Security governance standards ISO/IEC 27001
Information security policies and processes Please refer to the service definition document.

Operational security

Operational security
Configuration and change management standard Supplier-defined controls
Configuration and change management approach Please refer to the service definition document.
Vulnerability management type Supplier-defined controls
Vulnerability management approach Please refer to the service definition document
Protective monitoring type Supplier-defined controls
Protective monitoring approach Please refer to the service definition document
Incident management type Supplier-defined controls
Incident management approach Support response times 24 x 7 Support (Gold) Subscription

Secure development

Secure development
Approach to secure software development best practice Conforms to a recognised standard, but self-assessed

Public sector networks

Public sector networks
Connection to public sector networks No

Pricing

Pricing
Price £3600 per instance per year
Discount for educational organisations No
Free trial available Yes
Description of free trial 30 days
Link to free trial https://www.cloudera.com/products/data-science-and-engineering/data-science-workbench.html

Service documents

ods document: Pricing document pdf document: Service definition document pdf document: Terms and conditions pdf document: Modern Slavery statement
Service documents
Return to top ↑