Hello.
I am beginner for parallel compilers.
I am looking for any compiler or utility for automatic detection and compilation for parallelizable code segments.
For example, if the compiler found a simple “for” loop,
I imagine that the compiler can detect whether it can be effectively executable on OpenCL-devices or not. Then, it can generate OpenCL kernel code or otherwise CPU-executable codes, automatically. (or, using LLVM, it can be somewhat easier.)
But, I found that typical CUDA or OpenCL compilers need the explicit marking on the parallel kernel codes.
Do you know any information on this kind of automatic parallelization and/or compilers?
Thanks in advance.