Prompt Engineering vs Finetuning

When to use one vs the other?

Have access to a special dataset (e.g. user or customer data) and want better performance
Improved latency (10x or more)
Improved cost
Trying to achieve a particular tone
Local/private model

In general, start with prompt engineering and push that as far as you can. Only then, look to finetuning for further optimisation.