• Louis Feng's avatar
    More efficient sum for some cases (#1251) · f8941a12
    Louis Feng authored
    * hacking to support dot of 3 by 2 inputs with gemm_batch.
    
    * clean up.
    
    * testing inplace reshape.
    
    * fixed a compile error.
    
    * added comments on todo.
    
    * check for output.
    
    * check for annotation.
    
    * more optimizations WIP.
    
    * sum simd.
    
    * moved parallel for
    
    * testing sum vectorization.
    
    * fixed merge errors.
    
    * sum wip.
    
    * more logic.
    
    * sum refactor and clean up.
    
    * clean up.
    
    * removed unrelated changes.
    
    * removed related changes from merge.
    
    * fixed clang compile errors.
    f8941a12
Name
Last commit
Last update
.ci/travis/ubuntu Loading commit data...
cmake Loading commit data...
contrib/docker Loading commit data...
doc Loading commit data...
licenses Loading commit data...
maint Loading commit data...
python Loading commit data...
src Loading commit data...
test Loading commit data...
.clang-format Loading commit data...
.gitignore Loading commit data...
.gitmodules Loading commit data...
.travis.yml Loading commit data...
CMakeLists.txt Loading commit data...
CONTRIB.md Loading commit data...
INSTALL.md Loading commit data...
LICENSE Loading commit data...
README.md Loading commit data...
VERSION.in Loading commit data...
changes.md Loading commit data...