Cloudera, Inc.

Cloudera Data Science Workbench

Cloudera Data Science Workbench (CDSW) enables fast, easy, and secure self-service data science for the enterprise. Data scientists can bring existing tools, such as R, Python, and Scala, to securely run on data. CDSW lets data scientists manage their own analytics pipelines, accelerating machine learning projects from exploration to production.

Features

  • Project Collaboration
  • Version Control
  • Secure access to data
  • Support for R, Scala and Python
  • Machine Learning and Analytics
  • Job Management
  • Machine Learning Model building
  • Machine Learning Model scoring
  • GPU support
  • Virtualisation and containers

Benefits

  • Fast, easy, and secure self-service data science for the enterprise
  • Data scientists can bring existing skills and tools
  • Securely run computations on data in Hadoop clusters
  • Interactive user sessions with Python, R, and Scala

Pricing

£3600 per instance per year

Service documents

Framework

G-Cloud 11

Service ID

5 1 9 9 4 9 6 7 8 9 7 2 0 8 2

Contact

Cloudera, Inc.

James Underhill

07813563926

junderhill@cloudera.com

Service scope

Software add-on or extension
Yes
What software services is the service an extension to
Cloudera Data Science Workbench requires a Data Science and Engineering, or Enterprise Data Hub subscription for each node that it is installed on.
Cloud deployment model
  • Public cloud
  • Private cloud
  • Hybrid cloud
Service constraints
Cloudera Self Managed Cloud has restrictions on supported 3rd party components. This is described in Cloudera Reference Architectures which are maintained on our website.

Please refer to the service definition document for more information.
System requirements
  • Requires a supported OS and VM configuration
  • Requires Java 1.8+
  • Requires Spark 2.0+
  • Requires Cloudera EDH
  • Requires a wildcard DNS configuration

User support

Email or online ticketing support
Yes, at extra cost
Support response times
We provide 24/7 email and online support as part of our subscription
User can manage status and priority of support tickets
Yes
Online ticketing support accessibility
None or don’t know
Phone support
Yes
Phone support availability
24 hours, 7 days a week
Web chat support
No
Onsite support
Yes, at extra cost
Support levels
CASE INITIAL RESPONSE UPDATE FREQUENCY TARGET PRIORITY TARGET 24x7SUBSCRIPTION 24x7 SUBSCRIPTION P1 Within 1 hour Updated every 4 hours P2 Within 2 hours Updated every business day P3 Within 8 hours Updated every 3 business days P4 Within 24 hours N/A, feature request CASE INITIAL RESPONSE TARGET UPDATE FREQUENCY TARGET PRIORITY 8x5 SUBSCRIPTION 8x5 SUBSCRIPTION P1 Within 1 business hour Updated every 4 business hours P2 Within 2 business hours Updated every business day P3 Within 8 business hours Updated every 3 business days P4 Within 2 business days N/A, feature request
Support available to third parties
Yes

Onboarding and offboarding

Getting started
On procurement of the service, a Cloudera Account Executive will contact the customer to arrange onboarding. This will include delivery of a license key to enable the enterprise features of the Cloudera software and onboarding of Primary

Support Contact(s) from the customer organization.

A defined process exists whereby within 24 hours, an introduction email and license entitlement is provided to thenominated customer contact. A Proactive Customer Onboarding representative will set up a call within one week to arrange the details required to provide our support services.
Service documentation
Yes
Documentation formats
  • HTML
  • PDF
End-of-contract data extraction
Cloudera is built on Open Source Software and Open Standards. Once the contract ends the storage services can still be accessed and all customer owned data can be off-loaded.
End-of-contract process
At the end of the contract the license key will expire and the customer will no longer be licensed to use the Enterprise features.

Using the service

Web browser interface
Yes
Supported browsers
  • Internet Explorer 10
  • Internet Explorer 11
  • Microsoft Edge
  • Firefox
  • Chrome
  • Safari 9+
Application to install
Yes
Compatible operating systems
  • Linux or Unix
  • Other
Designed for use on mobile devices
No
Service interface
No
API
Yes
What users can and can't do using the API
The API enables the scheduling of Jobs. The platform also presents capabilities via a REST API for the development of micro services.
API documentation
Yes
API documentation formats
HTML
API sandbox or test environment
No
Customisation available
Yes
Description of customisation
The service presents APIs for data models. These APIs can be configured by the end user.

Scaling

Independence of resources
Resource quotas and containerisation are used to isolate users from each other.

Analytics

Service usage metrics
Yes
Metrics types
The platform can monitor CPU, Memory, GPU, Application Storage and session run statistics.
Reporting types
  • API access
  • Real-time dashboards

Resellers

Supplier type
Not a reseller

Staff security

Staff security clearance
Other security clearance
Government security clearance
Up to Developed Vetting (DV)

Asset protection

Knowledge of data storage and processing locations
Yes
Data storage and processing locations
United Kingdom
User control over data storage and processing locations
Yes
Datacentre security standards
Supplier-defined controls
Penetration testing frequency
Never
Protecting data at rest
Encryption of all physical media
Data sanitisation process
No
Equipment disposal approach
A third-party destruction service

Data importing and exporting

Data export approach
Users can copy the data from the platform using a range of software including API, CLI and UI.
Data export formats
  • CSV
  • Other
Other data export formats
  • Avro
  • Parquet
  • Original Format
Data import formats
  • CSV
  • Other
Other data import formats
  • Avro
  • Parquet
  • Original Format

Data-in-transit protection

Data protection between buyer and supplier networks
TLS (version 1.2 or above)
Data protection within supplier network
TLS (version 1.2 or above)

Availability and resilience

Guaranteed availability
Please refer to the service definition document.
Approach to resilience
Please refer to the service definition document.
Outage reporting
Please refer to the service definition document.

Identity and authentication

User authentication needed
Yes
User authentication
  • Public key authentication (including by TLS client certificate)
  • Identity federation with existing provider (for example Google Apps)
Access restrictions in management interfaces and support channels
The software uses Kerberos Principles to authenticate users and administrators.

Cloudera uses a SSO framework for support. All customers need to be registered with Cloudera in order to access management interfaces and support channels.
Access restriction testing frequency
At least once a year
Management access authentication
  • Public key authentication (including by TLS client certificate)
  • Identity federation with existing provider (for example Google Apps)
  • Dedicated link (for example VPN)
  • Username or password

Audit information for users

Access to user activity audit information
Users have access to real-time audit information
How long user audit data is stored for
User-defined
Access to supplier activity audit information
Users contact the support team to get audit information
How long supplier audit data is stored for
User-defined
How long system logs are stored for
User-defined

Standards and certifications

ISO/IEC 27001 certification
No
ISO 28000:2007 certification
No
CSA STAR certification
No
PCI certification
No
Other security certifications
No

Security governance

Named board-level person responsible for service security
Yes
Security governance certified
Yes
Security governance standards
ISO/IEC 27001
Information security policies and processes
Please refer to the service definition document.

Operational security

Configuration and change management standard
Supplier-defined controls
Configuration and change management approach
Please refer to the service definition document.
Vulnerability management type
Supplier-defined controls
Vulnerability management approach
Please refer to the service definition document
Protective monitoring type
Supplier-defined controls
Protective monitoring approach
Please refer to the service definition document
Incident management type
Supplier-defined controls
Incident management approach
Support response times 24 x 7 Support (Gold) Subscription

Secure development

Approach to secure software development best practice
Conforms to a recognised standard, but self-assessed

Public sector networks

Connection to public sector networks
No

Pricing

Price
£3600 per instance per year
Discount for educational organisations
No
Free trial available
Yes
Description of free trial
30 days
Link to free trial
https://www.cloudera.com/products/data-science-and-engineering/data-science-workbench.html

Service documents

Return to top ↑