was successfully added to your cart.

John Snow Labs NLP on Databricks

30 days free trial with no limit on the amount of processed data. Provide a Databricks access token for installing Healthcare NLP and Spark OCR libraries on your Databricks instance, on a cluster of your choice.
Get full access to:
Spark NLP
State-of-the-art natural language processing for Python, Java, or Scala
Healthcare NLP
State-of-the-art clinical & biomedical natural language processing
Spark OCR
Scalable, private, and highly accurate OCR and de-identification library
20+ ready-to-use
Jupiter notebooks for the most common tasks
Then Pay Only for What You Use
  • After the trial you need to switch to a paid subscription
  • You will be charged $4.95/DBU
  • Invoiced once/month directly by John Snow Labs

Spark NLP on Databricks FAQ


John Snow Labs NLP package includes:

  • Spark NLP library,
  • Spark NLP for Healthcare library,
  • Spark OCR library,
  • access to all pre-trained models and pipelines published on the NLP Models Hub,
  • access to 20+ Jupiter notebooks for the most common NLP tasks
  • premium support,
  • all updates to the software & models that are released during the subscription period.

Using the above form you can apply for a free trial. At the end of the trial, if you want to continue using the software you provide us with billing information and we’ll setup a commercial subscription for you.

The software will stop processing documents – for both training and inference. If you choose to buy a license, we will provide you new credentials that will reactivate it. Otherwise, you must uninstall the software. In any case, data you have already processed is yours to keep.

No. Once you have a valid subscription, you can use John Snow Labs NLP on any number of clusters, jobs and share it with as many users as you want inside of your account.

Spark NLP library is free, forever, unlimited, for personal and commercial use. Spark NLP is released under an Apache 2.0 open-source license – including the pre-trained models and documentation.

Running John Snow Labs NLP on Databricks

Python and Scala.

We officially support AWS, Azure and GCP Databricks.

The configuration we recommend for AWS is r4.2xlarge (autoscaling, min workers: 1, max workers: 4).
The configuration we recommend for Azure is Standard_DS13_v2 (autoscaling, min workers: 1, max workers: 4)


The John Snow Labs NLP package is offered at $4.95/DBU.

No. John Snow Labs subscription only includes access to the NLP libraries. Databricks subscription costs are invoiced and paid separately.

No. You only pay for the compute resources you use. Spin up your cluster, use it according to your needs and pause it when you are done. You will be charged according to the consumed DBUs.

We invoice every 1st day of month for the previous month based on DBU usage reports received from Databricks.

Online payments via credit cards.

All our payments are handled via Stripe, a PCI Service Provider Level 1 which is the highest grade of payment processing security. You can rest assured that your payment information is safe and secure.

Yes! Please email us to describe your situation and needs.


No. You install and run the software on your Databricks infrastructure. The software does not “call home” and no data or results are sent to John Snow Labs.

You do. We will never even see them.

This is not a SaaS solution – instead, you run the software on your infrastructure and are fully responsible for protecting it. The libraries do not send anything to John Snow Labs or to another third party.



Yes. John Snow Labs NLP is designed to enable you to train & tune your own models for most tasks.

Yes. A custom deployment script is provided for all John Snow Labs NLP subscriptions (trials included) that you can attach to any new cluster and run for a frictionless installation and configuration of your new cluster.

The full list is available here. Expect the list to keep growing over time.


Email support@johnsnowlabs.com, call us at +1-302-786-5227, or start a chat on spark-nlp.slack.com.

Same business day 8x5 support is included with all subscriptions. We can also provide 24x7 support for production systems – please email us if you require it.

Yes. When subscribing to John Snow Labs NLP on Databricks you will get 20+ ready to use python notebooks that will help you speed up your project and solve the most common NLP tasks.