When I wrote my feed forward routine I had to come up with a way of sharing intermediate results between all cores. The method I used worked but I thought I could do better and make it a little more programmer friendly at the same time.
There are is a write-up of the alternative methods I tested and my (somewhat surprising) results: