by joseluisquiroga » Wed Jul 05, 2017 3:40 pm
Thanks for the post.
I do not know why I had in mind a 2x faster relation between w/r for on-chip transactions. Maybe I got that idea from the image in page 26 of the Epiphany Arch Ref Manual. While looking to quote an answer from the manual I found in page 24 that it says that it is 16x.
Maybe you know where the 16x comes from?
It seems a lot but for my purposes I think I will stick to local writes. My big O complexity function still the same.
How to keep code small enough to run it from local mem? This seems to be the way. Local reads of remote written data means your code will look like networking code.
Running it from off-chip mem seems a lot worse than this 16x because you loose all parallellism benefits. Off-chip mem is a bottle neck (not to mention indeterministic behavior). So, any program that relies, at run time, on off-chip data or off-chip code will have this issue.
Cheers.