PDI Production Options

Question asked by Ben Pusey on Mar 19, 2018


Hi Everyone, I'm new to PDI and I'm a little confused about deployment. I've got transformations and jobs running locally that do what I want but I need them to run scheduled on a remote server.

I've read around this topic and just confused myself, do I need a repo? Does it have to have a database? Is this the same as Carte? Should I just get an Ubuntu VM with remote desktop and do it that way?

Apologies if I'm being slow but I can't seem to find a single source of info on the simplest way to go from running jobs on my desktop to deploying them to production.


My use case is fairly simple; it's basic ETL from OLTP databases to an AWS Redshift data warehouse, it's only me that will be setting up and running jobs and none of the jobs are particularly intensive.


If anyone can point me in the direction of the simplest method of getting up and running I'd be very grateful. Doubly so if there's a way to do it using AWS EC2.


Many thanks.