Commit Graph

592 Commits (8113de942547d20f923b5af825eddcef99249d90)

Author SHA1 Message Date
fengjiayi 2f856769b9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_double_buffer_fix
7 years ago
fengjiayi 55e4b89f14 remove local_buffer_
7 years ago
fengjiayi 7bb18433fd refine code
7 years ago
Qiao Longfei 63cd5fb0b1
Merge pull request #9523 from jacquesqiao/fix-test_send_recv
7 years ago
Yancey1989 c3580eae46 Add prefetch interface on server side
7 years ago
Yu Yang 53fa7cb9cc Add local cache of double buffer reader
7 years ago
Tomasz Patejko b9874251c6 Plain LRN op throws an exception when is_test is set in backward pass
7 years ago
fengjiayi 95658767eb
Merge pull request #9428 from JiayiFeng/kernel_of_increment_op
7 years ago
typhoonzero 52439d9f1d Merge branch 'fix-test_send_recv' of https://github.com/jacquesqiao/Paddle into fix_server_shutdown
7 years ago
typhoonzero f6de248323 fix server shutdown
7 years ago
Yancey 374f1ca3b7 Fix dist error with lr decay layer (#9489)
7 years ago
Qiao Longfei f0af1398b8
add prefetch_op (#9495)
7 years ago
dzhwinter fbdb5b7b43 "fix based on comment"
7 years ago
Yi Wang c1c5e166d1 Fix cpplint errors
7 years ago
Yi Wang 64242c5d71 Rename test_serde into serde_test
7 years ago
Yang Yu af230d9bef Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
qiaolongfei e727cdb62d fix block num
7 years ago
dzhwinter a80bf702f3 Merge remote-tracking branch 'origin/develop' into speed/sequence_expand
7 years ago
fengjiayi 1a4b0d63e4
Merge pull request #9352 from JiayiFeng/doc_update_reader_doc
7 years ago
dzhwinter 8425c2c859
Speed/sequence op1 (#9217)
7 years ago
guosheng 5b8bb34470 Refine reshape_op by following comments.
7 years ago
fengjiayi 1e4f442a84 fix a compile error
7 years ago
武毅 d21ab2e2ba
Merge pull request #9448 from typhoonzero/fix_dist_slr_height
7 years ago
chengduo 24100e1fb8
Merge pull request #9449 from chengduoZH/feature/add_cos
7 years ago
JiayiFeng 52574733a6 Add KernelType switch for IncrementOp kernel
7 years ago
JiayiFeng 0ac43217ce check whether scalar condition var is on CPU before using
7 years ago
typhoonzero 450be963fe fix sparse errors
7 years ago
chengduoZH bdda08d9f2 add sin
7 years ago
JiayiFeng 01c5ca7364 fix bugs
7 years ago
JiayiFeng 917b205c1c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into kernel_of_increment_op
7 years ago
chengduoZH 2e577379ca add cos
7 years ago
dzhwinter 0412f5e09b "fix ci"
7 years ago
fengjiayi 802dcd676e remove CPU restrict in While_op
7 years ago
dzhwinter 0be1e09f2c "fix ci"
7 years ago
Yang Yu b0775588c0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
typhoonzero 9a9d67dac2 fix dist train selected rows height missing
7 years ago
fengjiayi 6dfc33c226 fix compile errors
7 years ago
fengjiayi e9370fe59f fix compile bugs
7 years ago
fengjiayi 0ce558f19e kernels of increment op
7 years ago
guosheng 4bfbc59122 Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into enhance-ReshapeOp
7 years ago
guosheng c078ed4608 Enhance reshape_op by adding Input(Shape)
7 years ago
yi.wu cc1c6afbbf fix slr serde
7 years ago
typhoonzero 094d509689 fix slr deser
7 years ago
typhoonzero 587781153e fix slr deser
7 years ago
qingqing01 25317bd312
Make the first device share data with the global scope in parallel_do_op. (#9398)
7 years ago
dzhwinter 5447046aee merge develop branch
7 years ago
dzhwinter db1b128feb "add details"
7 years ago
gongweibao e0b5691e41
Add drop_out_op unit test (#9364)
7 years ago
chengduoZH ab601c19c3 Add CUDAPinnedPlace
7 years ago
Tao Luo 1b67bc022c
Merge pull request #9329 from tpatejko/tpatejko/mkldnn-lrn
7 years ago
Abhinav Arora 65534c4762
Fluid channels should match the semantics of Go Channels (#9265)
7 years ago
Qiao Longfei f3dc3112cc
add split ids op (#9370)
7 years ago
Tao Luo c858f48979
Merge pull request #8887 from luotao1/infer_mkl
7 years ago
typhoonzero 1ab4fcb5e7 prepare pserver executor
7 years ago
Yu Yang 02aaecca35 Fix CPU compile
7 years ago
Xin Pan 3941c2ddec
Merge pull request #9355 from panyx0718/layer_norm
7 years ago
Luo Tao 6332bd1ed8 Merge branch 'develop' into infer_mkl
7 years ago
Yu Yang 50e7e25db3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
Qiao Longfei 4f522fa8d5
fix compile send_op on mac (#9360)
7 years ago
Yancey 1b0a17f415
Merge pull request #9303 from Yancey1989/split_send_op
7 years ago
Yancey1989 ebbb428db9 fix ci
7 years ago
Tao Luo cb3bbbd5c6
Merge pull request #9081 from kbinias/kbinias/mkldnn-activations
7 years ago
chengduo 4a92e89623
Merge pull request #9337 from chengduoZH/feature/fix_concat
7 years ago
武毅 12856c5f69
Merge pull request #9325 from dzhwinter/fix/dropout1
7 years ago
chengduoZH aca9180a76 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/fix_concat
7 years ago
Xin Pan 1a4be55a47 Pass cpu build
7 years ago
Xin Pan 904fa05f46 Improve layer_norm speed
7 years ago
Yancey1989 79af7cc9d3 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into split_send_op
7 years ago
Yancey1989 081b782434 update by comment
7 years ago
fengjiayi dd532e2086 refine MultiPassReader's doc string
7 years ago
gongweibao cffe1a9112
Profiler can get elapsed time of `sendop` (#9345)
7 years ago
Krzysztof Binias d8bd436fc1 Fixed tests
7 years ago
Krzysztof Binias a64b312e3a Correcting for PR comments
7 years ago
Krzysztof Binias 4466f0bec8 MKLDNN Relu Tanh Sqrt Abs activations added
7 years ago
chengduoZH 750aff10ce code refine
7 years ago
chengduoZH 043f47b27f fix concat op
7 years ago
yi.wu bb815d4364 update
7 years ago
yi.wu a9a228ad8d fix dist compile
7 years ago
Luo Tao ae820a34bc Merge branch 'develop' into infer_mkl
7 years ago
Varun Arora 76ae540f8e
Move Select to concurrency.py; incorporate outputs (#9136)
7 years ago
武毅 9c35b0dc1b
Merge pull request #9287 from typhoonzero/pserver_prepare_before_run
7 years ago
Tomasz Patejko 14ba67c0ef Function for running MKLDNN primitive added. Unittest added for is_test attribute
7 years ago
Tao Luo e027eb40d7
Merge pull request #9123 from tpatejko/tpatejko/mkldnn-lrn
7 years ago
dzhwinter e33af2414b "fast hack"
7 years ago
typhoonzero 9367f11eb7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into pserver_prepare_before_run
7 years ago
Yancey ee7f1ecd7c
Fix dist compile error (#9320)
7 years ago
Tao Luo 9126e626fc
Merge pull request #9165 from ROCmSoftwarePlatform/amd_cmake_01
7 years ago
guosheng b7e83d2467 Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into enhance-ReshapeOp
7 years ago
qingqing01 8f8728635a
Fix bug for backward tanspiler when using parallel_do operator. (#9282)
7 years ago
typhoonzero a88cc46221 update
7 years ago
typhoonzero 972a102b92 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into pserver_prepare_before_run
7 years ago
typhoonzero 5e6276edc1 fix transpiler bug
7 years ago
Yu Yang 9e3e424ecb
Merge pull request #9299 from reyoung/feature/refactor_batch_norm
7 years ago
Liu Yiqun 987a32dac3 Merge branch 'develop' into core_inference_fix_run
7 years ago
gongweibao 990d6396fe
Reuduce memory copy when communication between trainer and pserver. (#9271)
7 years ago
whs b594251f89
Merge pull request #9082 from wanghaoshuang/average_model
7 years ago
Kexin Zhao 64c5c8f8b0
Merge pull request #9269 from kexinzhao/softmax_cudnn_fp16
7 years ago
Kexin Zhao b9e6364e3c
Merge pull request #9267 from kexinzhao/new_relu_fp16
7 years ago
Kexin Zhao 4eaa789730 resolve conflict
7 years ago
Tomasz Patejko 72cc64e40e Device blobs are created only in training. Added testing attribute
7 years ago
dzhwinter 53c8c36a04 "debug the process"
7 years ago
tensor-tang 7260e3a443
Merge pull request #9214 from jczaja/prv-softmax-mkldnn-operator-PR
7 years ago
Yancey1989 2a4221ac07 split send op to send_vars and send_barrier
7 years ago
Yu Yang 0760aaf440 Shrink batch_norm_grad's inputs
7 years ago
guosheng 454b0a96be Remove the extra call of ValidateShape in ReshapeKernel
7 years ago
guosheng 437f7a3279 Resolve conflict according to the latest code
7 years ago
Jacek Czaja 3b95b55f07 - Softmax MKLDNN primitive integration
7 years ago
guosheng eb12cbe764 Refine reshape_op infershape
7 years ago
Liu Yiqun 0968753454 Enable the test of not creating variables every time.
7 years ago
typhoonzero 1eec926124 updates
7 years ago
typhoonzero e9d815e32b prepare and create op before run
7 years ago
Kexin Zhao ed2bc194c5
Merge pull request #9176 from kexinzhao/batch_norm_fp16
7 years ago
fengjiayi 809530f418 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_MultiEpochReader
7 years ago
fengjiayi 7c041e48f4
Merge pull request #9182 from JiayiFeng/dev_MultipleReader
7 years ago
typhoonzero 18461d0935 wip
7 years ago
wanghaoshuang edb4e29ab7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into average_model
7 years ago
Kexin Zhao b7801b9fcb small fix
7 years ago
Kexin Zhao 70e7122785 initial commit
7 years ago
Kexin Zhao d60180af39 inital commit
7 years ago
Kexin Zhao c1e9b1e37e
Merge pull request #9231 from kexinzhao/elementwise_add_fp16
7 years ago
dzhwinter e4c35d837d "add details"
7 years ago
fengjiayi 91b6d60003 Merge branch 'fix_bug_in_recordio' into dev_MultiEpochReader
7 years ago
fengjiayi 2532b922dc Add more unittests and fix bugs
7 years ago
dzhwinter 26822bd774 "add sequence kernel"
7 years ago
wanghaoshuang ad63722ed9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into average_model
7 years ago
fengjiayi f863866471 Add an unitest
7 years ago
武毅 5008020d19
Merge pull request #9154 from typhoonzero/pserver_parallel
7 years ago
fengjiayi 02b7d8bea5 Merge branch 'fix_bug_in_recordio' into dev_MultipleReader
7 years ago
typhoonzero 3666d7c02f fix num_blocks==2
7 years ago
sabreshao e50205e744 CMake refine for HIP support.
7 years ago
fengjiayi a2981f5c50 fix a bug
7 years ago
Yang yaming 381c6a026d
Merge pull request #9100 from pkuyym/fix-9049
7 years ago
Kexin Zhao d307b5e4a6 Merge remote-tracking branch 'upstream/develop' into elementwise_add_fp16
7 years ago
typhoonzero 139ae08fdf workable
7 years ago
Kexin Zhao 5271c32d24
Merge pull request #9223 from kexinzhao/dropout_fp16
7 years ago
Kexin Zhao 3da094fd7b rearrange test
7 years ago
fengjiayi 832deee448
Merge pull request #9178 from JiayiFeng/fix_bugs_in_reader
7 years ago
wanghaoshuang e01c770c05 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into average_model
7 years ago
wanghaoshuang d22f4de794 Refine sum_accumulates_op.
7 years ago
yangyaming 2c22552542 Fix some comments and adapt test_machine_translation.py.
7 years ago
fengjiayi 6f7e812bb3 fix bugs
7 years ago
dzhwinter 4ee1c9e60d "add sequence expand kernel"
7 years ago
yangyaming 2f2c5f5e60 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-9049
7 years ago
Kexin Zhao 4bf168b274 add fp16 kernel for elementwise add
7 years ago
Kexin Zhao d03dbb97f9 remove AttrType
7 years ago
Kexin Zhao 05ad15832a initial commit
7 years ago
Xi Chen 9eae086e39 add math_function to softmax's dep list
7 years ago
emailweixu b3f076a6e4
Merge pull request #9168 from emailweixu/fix_compile
7 years ago
yangyaming 869a6f9cea Add python wrapper.
7 years ago
fengjiayi d9868b0839 Add multi_pass_reader
7 years ago
Tomasz Patejko 2d95527527 Removing WITHIN_CHANNEL algorithm for lrn. CPU lrn operator works only with ACROSS_CHANNELS
7 years ago
Tomasz Patejko c51c446221 Content of GetExpectedKernelType moved to standalone function
7 years ago
Tomasz Patejko 192cc5dd32 Implementation of MKLDNN LRN
7 years ago
yangyaming 332b665fc7 Enhanced cpp implementation and unit test.
7 years ago
caoying03 a6e64242d8 follow comments.
7 years ago
Yu Yang 9cb8f50302 Complete fetch op
7 years ago
fengjiayi 07d38a9b9a refine patch
7 years ago
fengjiayi a571ef382e fix bugs
7 years ago
Kexin Zhao 446d54f5c3 update
7 years ago
Kexin Zhao ffa22a5f90 fix scaling param type
7 years ago
Tao Luo c0421379b7
Merge pull request #9043 from Xreki/core_inference_remove_clone
7 years ago
caoying03 c87d11a716 Merge branch 'develop' into enhance_reshape
7 years ago
Kexin Zhao e870947cfd fix batch norm fp16 param type
7 years ago
wanghaoshuang 92a01d4994 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into average_model
7 years ago
wanghaoshuang e0b136c0f9 Refine average accumulates op
7 years ago
typhoonzero 093e07d39e need to split var scopes
7 years ago
fengjiayi 3d677b1eca fix compile errors and make OpenFilesOpMaker derived from FileReaderMakerBase
7 years ago
fengjiayi 550622529c Add MultipleReader and open_files_op
7 years ago
Kexin Zhao 0a95a44b9a add python batch norm inference test
7 years ago
Kexin Zhao df99b16a16
Merge pull request #9167 from kexinzhao/pool2d_fp16
7 years ago
Kexin Zhao 39c676e208 initial commit
7 years ago
xuwei06 ab3543e35e Fix compilation for gcc5.4
7 years ago
Kexin Zhao 8ebfc153dd update
7 years ago
Kexin Zhao 3f5705c346
Merge pull request #9148 from kexinzhao/cast_op_fp16
7 years ago
Kexin Zhao bfbc25bdb8 add fp16 pool2d support
7 years ago
yangyaming 3b03e3748d Refine some ENFORCE.
7 years ago
yangyaming 58730ba131 Enhance unit test.
7 years ago
yangyaming bf3f56e899 Finish adaption for backward.
7 years ago
sabreshao 45c988d86a Demostration of cmake refine for HIP support.
7 years ago
Liu Yiqun 371c53f88c Add profiling event in feed, fetch and load op.
7 years ago
Yu Yang 5e87cd7574 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
Yu Yang 8b397d1602 Make recordio file reader thread-safe by default
7 years ago
Yu Yang 6f0dfd89a4 Single GPU ParallelExecutor complete
7 years ago
typhoonzero b8f4c8599e pserver runs in parallel
7 years ago
Kexin Zhao 8e7310146f
Merge pull request #9143 from kexinzhao/numpy_conv2d_pool2d_fp16
7 years ago
Kexin Zhao f3c5e81556 add fp16 for cast op
7 years ago
Tao Luo a448fbe9e1
Merge pull request #9134 from putcn/fix-selected-row-dep
7 years ago
Tao Luo 20be8e7e33
Merge pull request #9104 from ranqiu92/doc_dir
7 years ago
qingqing01 7c1a0b77a0
Delete the detection_output_op, which had been split into several operators. (#9121)
7 years ago
Kexin Zhao e967d19b0a add more tests
7 years ago
Kexin Zhao a13ec3432a fix test error
7 years ago
Kexin Zhao e4de5dc347 add conv2d fp16 support
7 years ago
Xi Chen d20c6eb6de add math_function to selected_rows_functor dependency list
7 years ago
qingqing01 1cd700d8e8
Fix bug in LRN operator. (#9124)
7 years ago
ranqiu 64775126f3 change the dir of docs
7 years ago
qingqing01 b5a16dca20
Fix a critical bug in softmax_with_cross_entropy_op backward. (#9120)
7 years ago
Yu Yang d84ddcf123 Stash
7 years ago
Thuan Nguyen 1e4c504e60 Implement Select OP (#9088)
7 years ago
Xin Pan d284cf88e5
Merge pull request #9037 from panyx0718/develop
7 years ago
dzhwinter 128adf53cb
[Speed]implement cudnn sequence softmax cudnn (#8978)
7 years ago
yangyaming 352fa41a16 Finish adapting forward.
7 years ago
wanghaoshuang d7e5e1f13d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into average_model
7 years ago
Kexin Zhao e26f1123da
Add fp16 mul op support and bind paddle fp16 to numpy fp16 (#9017)
7 years ago
dzhwinter 7140071152
"exported scatter to python" (#9038)
7 years ago
chengduo 11c43e5da3
Merge pull request #9072 from chengduoZH/feature/refine_parallel_do
7 years ago
wanghaoshuang 8a645685ce Add sum accumulator with window for model average
7 years ago
Xin Pan 4840c49b27 Better timeline
7 years ago
chengduoZH ef28e7deba refine parallel_do_grad
7 years ago
Yu Yang 48f213e5a1
Merge pull request #8991 from reyoung/feature/shuffle_reader
7 years ago
Cao Ying 881c5227ab
Merge pull request #8843 from zhouhanqing/Paddle-ReduceProd
7 years ago
武毅 d13ce35875 Feature/send recv can now retry (#9027)
7 years ago
dzhwinter 14fe40aaa6
Refine/nccl (#9009)
7 years ago
chengduo 788c600e9d
Merge pull request #8932 from chengduoZH/feature/add_concat_rows
7 years ago
chengduoZH 92e2207e18 refine doc
7 years ago
Yu Yang 164f2382af Polish code
7 years ago
chengduoZH ff09b21cd0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_concat_rows
7 years ago
Yu Yang e13aec601a
Merge pull request #8830 from reyoung/feature/recordio_file_reader
7 years ago
Yu Yang f9974a4a12 Make double_buffer reader async
7 years ago
Yu Yang a8c076e577 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/shuffle_reader
7 years ago
Luo Tao de13f0eb4e Merge branch 'develop' into infer_mkl
7 years ago
chengduoZH b9397b2668 remove concat_rows
7 years ago
QI JUN 7287630e83
Repair nccl op test (#8575)
7 years ago
caoying03 cf08185145 fix bugs and complete codes.
7 years ago
Yu Yang 225efa671f Remove dims in base class
7 years ago
QI JUN f7e9fe57d3
[Memory]More memory optimization policy (#8690)
7 years ago
Yu Yang 2ea4a5d96c Polish double buffer reader
7 years ago
kexinzhao 607eec30a8
Merge pull request #8946 from kexinzhao/fix_cuda_arch_fp16
7 years ago
Yancey b5ef315cf1
Fix dist compile error (#8987)
7 years ago
qingqing01 b3d26cd3ad
Fix bug in detection_output and mAP calculation in SSD. (#8985)
7 years ago
Yu Yang 46ae4075ee Polish ShuffleReader and test
7 years ago
chengduoZH f1c3ecb2b2 add concat rows
7 years ago
chengduo 685f03762e
Merge pull request #8890 from chengduoZH/feature/fix_bug_of_elementwise
7 years ago
Kexin Zhao 3b44b849d3 address comments
7 years ago
fengjiayi dd1244f3c9
Merge pull request #8943 from JiayiFeng/fix_bugs_in_readers
7 years ago
Yu Yang 7eedced82a Polish RecordIO
7 years ago
caoying03 a8cdd97ef5 Merge branch 'develop' into enhance_reshape
7 years ago
caoying03 1d4dfc0966 fix bugs.
7 years ago
Yu Yang cfca8a3a26 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/recordio_file_reader
7 years ago
Yu Yang fea43077f6 Refine
7 years ago
pzelazko-intel 4730a4be24 MKLDNN pool2d OP kernel added (#8879)
7 years ago
Kexin Zhao 95de7617eb fix bug
7 years ago
Kexin Zhao 1998d5afa2 add gpu info func to get compute cap
7 years ago
Kexin Zhao d400b4192d fix math function arch mismatch for older GPU
7 years ago
fengjiayi 614c33fb3a fix a potential bug in the c++ reader
7 years ago
chengduoZH 1509ce6638 enhancement look_up_table
7 years ago
fengjiayi aa3f5058d3
Merge pull request #8841 from JiayiFeng/dev_double_buffer_for_cpp_reader
7 years ago
QI JUN b341bac7e1
Refine cast op (#8923)
7 years ago
Yancey 8468037918
Fix sparse update memory error for distributed training (#8837)
7 years ago
fengjiayi 35e1e0d521 uses channel to replace the traditional buffer
7 years ago
fengjiayi b3a11fdf3a Merge branch 'rm_reader_HasNext' into dev_double_buffer_for_cpp_reader
7 years ago