The latest build of SA 16 is 2127. Are you running this build?
Do you mean that queries are fast initially because executed by multiple cores/processors in parallel and then they become slow because they are executed by a core/processor?
Have you already compared query plans with statistics before and after the performance issue?