Commit Graph

1186 Commits (44bb5dfbda3e8a44158d0ed9fa65475af69348c5)

Author SHA1 Message Date
Yi Wang b1a5a3cab8
Fix cpplint errors with float16* (#9751)
7 years ago
qingqing01 2b7e5bd366
Support testing during training by ParallelExecutor. (#9738)
7 years ago
fengjiayi 9b3c964341 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into modify_readers_to_fit_parallel_executor
7 years ago
Abhinav Arora 9a4ce6f1b6 Fix comparison warning in lod_reset_op.h (#9754)
7 years ago
Tao Luo ec89ed69ef
Merge pull request #9655 from kbinias/kbinias/mkldnn-activations-dim2
7 years ago
Xin Pan 817df54b36 remove unused codes (#9750)
7 years ago
Yi Wang f31a0da363 Restore inference CMakeLists.txt
7 years ago
Yi Wang 25ad6884bb Merge branch 'develop' of http://github.com/paddlepaddle/paddle into cpplint-memory-detail
7 years ago
Yi Wang e831bd43b0 Add ARCHIVE_START/END back
7 years ago
typhoonzero baea2cf178 wip
7 years ago
Yancey1989 974b253e3a update by comment
7 years ago
Liu Yiqun 8a2667cd6e Merge branch 'develop' into core_inference_prepare
7 years ago
Liu Yiqun 90f3a421c7 Change the argument's type from reference to pointer.
7 years ago
Liu Yiqun 2d1d677e26 Merge branch 'develop' into core_inference_multi_thread
7 years ago
Yancey1989 9eaf445809 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into prefech_prog_on_server
7 years ago
Yancey1989 f132f51eb4 prepare prefetch context
7 years ago
Yi Wang 080e442671 Update
7 years ago
Yi Wang 0564e74fe5 Update
7 years ago
Yi Wang e9ba79c880 Update
7 years ago
Liu Yiqun a9e826ed49 Add the check of has_feed/fetch_operators back.
7 years ago
Liu Yiqun 7b40f7ce4a Merge branch 'develop' into core_inference_prepare
7 years ago
fengjiayi 6be51f10ec Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into modify_readers_to_fit_parallel_executor
7 years ago
Yi Wang e309bcd43a Update
7 years ago
Yi Wang 770cec4e31 Fix compilation errors
7 years ago
Yi Wang 45bc4538c4 Update paddle_memory in CMakeLists.txt files
7 years ago
fengjiayi 38ba7e5af4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update_reader
7 years ago
fengjiayi 49ab52d64d Modify MultipleReader
7 years ago
Yi Wang e98e2d36bd Update
7 years ago
JiayiFeng 03ff0e58fe fix compile errors
7 years ago
fengjiayi 8fed780f14 Complete threaded reader
7 years ago
Liu Yiqun 208fcf5225 Merge branch 'develop' into core_inference_multi_thread
7 years ago
Luo Tao 50e036a4ed fix compiler error on `tensor_py.h`
7 years ago
Yi Wang ccd83ef6b9 Update
7 years ago
typhoonzero 01c6618de9 first wip commit
7 years ago
Yi Wang 903403b2e1 Fix errors
7 years ago
Yi Wang a2730d1e0b Rename Metadata into MemoryBlock::Desc
7 years ago
Yi Wang eebb205324 Update CMakeLists
7 years ago
Yi Wang 67ba884d2a Update CMakeLists
7 years ago
Yi Wang 478055bd9f Update CMakeLists.txt
7 years ago
Yi Wang 535646cf25 Update (#9717)
7 years ago
Yi Wang 402a9f1f24 Rewrite the interface of memroy/detail
7 years ago
Yi Wang e185502ebe
Fix cpplint errors with paddle/fluid/platform/dynload (#9715)
7 years ago
Yi Wang 544254fe4f
Correct fluid/memory (#9716)
7 years ago
Yi Wang 0c43a376e2
Fix cpplint errors with paddle/fluid/platform/gpu_info.* (#9710)
7 years ago
Yi Wang 55ffceaadb
Fix cpplint errors paddle/fluid/platform/place.* (#9711)
7 years ago
Yi Wang 809962625f
Fix cpplint errors of enforce.* (#9706)
7 years ago
Yi Wang ef4ee22668
Fix cpplint errors with paddle/fluid/platform/cpu_info* (#9708)
7 years ago
Yi Wang 5bb7d59e3a
Fix cpplint errors with paddle/fluid/inference (#9702)
7 years ago
Yi Wang 1543c4cf6a
Fix cpplint errors of paddle/fluid/pybind and add some tests (#9694)
7 years ago
Xin Pan b138d29c38 Avoid init_p2p all the times
7 years ago
Lei Wang bcb46f5549
Merge pull request #9688 from wangkuiyi/cpplint-recordio
7 years ago
Kexin Zhao b2a1c9e8b7 Add float16 support to non-cudnn softmax op on GPU (#9686)
7 years ago
Yi Wang c839ec6c47 Update rcordio
7 years ago
Yi Wang 797a7184ac
Unify Fluid code to Google C++ style (#9685)
7 years ago
Yi Wang 9f3ac225ad Unify Fluid code to Google C++ style
7 years ago
Kexin Zhao d00bd9eb72 Update the cuda API and enable tensor core for GEMM (#9622)
7 years ago
lgone2000 517f619501 fix pybind.cc compile error (#9681)
7 years ago
Abhinav Arora b5b7ea12fa
Fix CPPLint issues in tuple.h (#9670)
7 years ago
chengduo 3d3e5aeb9c
Merge pull request #9648 from chengduoZH/feature/expose_CUDAPinnedPlace_to_python
7 years ago
chengduoZH 4ff237f93c follow comments
7 years ago
Lei Wang 09b4a1a361 Build: generate all the build related files into one directory. (#9512)
7 years ago
Yi Wang 30061d4831
Fix cpplint errors in paddle/fluid/string (#9667)
7 years ago
Qiao Longfei 424dd2fc43
Merge pull request #9597 from jacquesqiao/sgd-support-update-selected-rows
7 years ago
Abhinav Arora 7bf82f82b1
Fix CPPlint errors in channel.h, channel_impl.h and channel_test.cc (#9628)
7 years ago
Kexin Zhao d904b3dd1d
Merge pull request #9623 from kexinzhao/enable_cudnn_tensor_core
7 years ago
qiaolongfei ff4208e676 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into sgd-support-update-selected-rows
7 years ago
qiaolongfei e66bd4cb73 add GetDataTypeOfVar
7 years ago
chengduoZH 17842e33c3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/expose_CUDAPinnedPlace_to_python
7 years ago
fengjiayi 442c150333 a draft of ThreadedReader
7 years ago
Xin Pan 043c23080c
Merge pull request #9637 from panyx0718/feed
7 years ago
mozga-intel f8dd03dced Prepare code for CentOS (#9651)
7 years ago
chengduoZH 8e4e155c52 add PyCUDAPinnedTensorSetFromArray
7 years ago
Krzysztof Binias 76550d87b7 Reformat code
7 years ago
Krzysztof Binias c4107748d0 Add support for dim equals 2 in activation functions
7 years ago
Tao Luo 5eb9cecc4a
Merge pull request #9607 from luotao1/remove_var
7 years ago
Liu Yiqun fecbf7ff73 Merge branch 'develop' into core_inference_fix_run
7 years ago
Yancey1989 46989663b1 prefetch selected rows
7 years ago
qingqing01 a84a580e65
Add CUDA kernel for prior_box_op. (#9553)
7 years ago
chengduoZH 6af178356b expose CUDAPinnedPlace to Python
7 years ago
Yancey1989 abfd9fe798 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into prefech_prog_on_server
7 years ago
Liu Yiqun 3a82578278 Merge branch 'develop' into core_inference_prepare
7 years ago
Wu Yi d139f2ca4e
Merge pull request #9595 from typhoonzero/fix_test_sendrecv_portbind
7 years ago
Yancey1989 1842758117 prefetch prog run on new scope
7 years ago
Liu Yiqun 27f553b377 Add the check of CPU results and GPU results in multi-thread unittest.
7 years ago
fengjiayi af24290123 Add 'buffer_size' api for open_files op
7 years ago
fengjiayi 8441347808 fix lint
7 years ago
fengjiayi a122dcccd5 fix lint
7 years ago
fengjiayi 3cd9e450ac fix lint
7 years ago
Xin Pan 4bbfa9eccb Add feed to ParallelExecutor
7 years ago
fengjiayi f2c0b88649
Merge pull request #9550 from JiayiFeng/make_MultipleReader_thread-safe
7 years ago
Yu Yang bc8f436061
Merge pull request #9634 from reyoung/feature/fix_leaf_ops
7 years ago
Liu Yiqun 9cba062252 Add inferface to change the feed/fetch_holder_name.
7 years ago
chengduo c14305f0f6
Merge pull request #9380 from chengduoZH/feature/add_CUDAPinnedPlace
7 years ago
Yu Yang 27533b6423 Fix Leaf Ops in Graph
7 years ago
Yi Wang 68a7534461
Merge branch 'develop' into fix_cpplint_errors_operators_detail
7 years ago
typhoonzero b03fa889b7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_test_sendrecv_portbind
7 years ago
Tao Luo a98a3fdc46
Merge pull request #9385 from mozga-intel/mozga/mkldnn-fc
7 years ago
Liu Yiqun fbd3604cad Split Executor.Run to Executor.Prepare and Executor.RunPreparedContext for inference.
7 years ago
Wu Yi fb8c1cf07c
Merge pull request #9377 from typhoonzero/prepare_pserver_executor
7 years ago
Kexin Zhao 9ba36604d8 fix cpplint error
7 years ago
Kexin Zhao 187ba08789 enable tensor core for conv cudnn
7 years ago
Yi Wang 66e0aed700
Fix compilation warnings of fix_split_ids_op.h (#9619)
7 years ago
Darcy 5f29cc76c5
Merge pull request #9618 from wangkuiyi/fix_variable_response
7 years ago
Yi Wang e941914a33 Fix compilation warnings in variable_response.cc
7 years ago
Yi Wang 97eac501b1 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into fix_cpplint_errors_operators_detail
7 years ago
Yi Wang ebcf5fb9d4 Fix compare warning
7 years ago
Yi Wang 54316bdd7e Update
7 years ago
Yi Wang 937797849e Fix serde_test.cc compile error
7 years ago
chengduoZH 51c22fe434 follow comments
7 years ago
chengduoZH e099b18045 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_CUDAPinnedPlace
7 years ago
Luo Tao 09b53c086d add remove_var from c++ end
7 years ago
chengduoZH 766c74055a follow comments
7 years ago
mozga-intel 46e14bbcbb Enforce: 2 and 4 dims, remove information about out in format
7 years ago
mozga-intel 32f8ac7d3b Remove additional message
7 years ago
mozga-intel 34a8084328 Added new fc files, register fc kernel
7 years ago
mozga-intel 2811ea4440 Implementation of MKLDNN FC
7 years ago
qiaolongfei 31e8d807d9 optimize code
7 years ago
typhoonzero 1f6e0448bc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into prepare_pserver_executor
7 years ago
typhoonzero b853ac8c8b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_test_sendrecv_portbind
7 years ago
Liu Yiqun e359b24a85 Merge branch 'develop' into core_inference_multi_thread
7 years ago
typhoonzero 44c29abdbf remove comments
7 years ago
typhoonzero 00f8e63b8d update
7 years ago
Xin Pan b123ce88a1 Add enable/disable for delayed ops
7 years ago
Xin Pan be1373dcf9 Polish
7 years ago
Xin Pan 46f3a39e91 polish and add comments.
7 years ago
Xin Pan d0ac92531d Improve ParallelExecutor performance
7 years ago
typhoonzero 94eea16e6d fix sendrecv port bind
7 years ago
Yancey1989 0cafe39010 run prefetch prog on server
7 years ago
fengjiayi 44d5f42a7e update reader
7 years ago
chengduo dd75fbde81
Merge pull request #9560 from chengduoZH/feature/fix_parallel_exe
7 years ago
qiaolongfei e64dda7e44 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into sgd-support-update-selected-rows
7 years ago
Liu Yiqun 8137f58c06 Merge branch 'develop' into core_inference_fix_run
7 years ago
Yi Wang 736950603e Resolve conflicts
7 years ago
Qiao Longfei 37d9a72e9e
Merge pull request #9575 from jacquesqiao/lookup_table_support_SelectedRows_as_parameter
7 years ago
qiaolongfei 2669aea67f sgd_op support optimize SelectedRows
7 years ago
Liu Yiqun 2a2e22e35f Merge branch 'develop' into core_inference_fix_run
7 years ago
typhoonzero 92818baa57 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into threadpool_for_io
7 years ago
typhoonzero b851c0739f update compile
7 years ago
chengduoZH 494bee5135 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/fix_parallel_exe
7 years ago
qiaolongfei 13ecb5e50a Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into lookup_table_support_SelectedRows_as_parameter
7 years ago
Helin Wang c4720376c6 disable ncclAllReduceOp as well
7 years ago
Helin Wang 9fbe90ef96 fix according to comments
7 years ago
Helin Wang 9365d110b5 temporaryly disable ncclBcastOp test, it fails randomly
7 years ago
chengduoZH 2514d70ea7 follow comments
7 years ago
typhoonzero f43be75b82 multi stream thread pool
7 years ago
chengduoZH 30adc0b5f8 add notation
7 years ago
fengjiayi 899827f28c
Merge pull request #9535 from reyoung/feature/fix_double_buffer
7 years ago
qiaolongfei 7a6ffb6280 add TestLookupTableWIsSelectedRows
7 years ago
fengjiayi b94f24d44f Move StartPrefetcher and EndPrefetcher to private
7 years ago
typhoonzero 75bfdb3a3c refine
7 years ago
qiaolongfei a94e25740e optimize code
7 years ago
qiaolongfei 6fff0d4d4c update LookupTableGradKernel
7 years ago
Qiao Longfei 3fd9266278
Merge pull request #9564 from jacquesqiao/fix-compile
7 years ago
qiaolongfei 3b3d210c3e lookuptable support SelectedRows as table parameter
7 years ago
qiaolongfei 9a101cfc08 clean code
7 years ago
dzhwinter 62ba8723ac Merge remote-tracking branch 'origin/develop' into speed/sequence_expand
7 years ago
qiaolongfei 997e9a1fd2 fix mac compile
7 years ago
Yancey 09fcf5f272
Merge pull request #9555 from jacquesqiao/improve-prefetch-on-server
7 years ago
Guo Sheng d908c3b2fa
Merge pull request #9008 from lcy-seso/enhance_reshape
7 years ago
chengduoZH de5e56bee8 add og has been broadcasted
7 years ago
qiaolongfei 04a5c03785 add todo
7 years ago
Qiao Longfei e2bb405252
Merge branch 'develop' into improve-prefetch-on-server
7 years ago
typhoonzero fc4bcdd753 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into prepare_pserver_executor
7 years ago
Yancey1989 eb04ccbf80 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into prefetch_on_server
7 years ago
Yancey1989 606c57da23 update by comment
7 years ago
qiaolongfei 9af9effc93 optimize code
7 years ago
qiaolongfei 453630692e fix prefetch hang problem, add some more logs
7 years ago
fengjiayi 2945a98eb3 Make MultipleReader thread-safe
7 years ago
Yi Wang 9e2581847d Merge branch 'develop' of http://github.com/paddlepaddle/paddle into fix_cpplint_errors_operators_detail
7 years ago
Yi Wang 01667392ad
Rename test_serde into serde_test (#9504)
7 years ago
Abhinav Arora ffcc760478 Fix deadlock in channel_test (#9544)
7 years ago
JiayiFeng 597c845c99 fix typo
7 years ago
JiayiFeng c0257f0a5b Add comments
7 years ago
JiayiFeng 5aa440fd7a Add move constructor for Item
7 years ago
Yi Wang 1bbbc4e76f Merge branch 'develop' of http://github.com/paddlepaddle/paddle into fix_cpplint_errors_operators_detail
7 years ago
qiaolongfei fdecae5fc5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into prefetch_on_server
7 years ago
qiaolongfei 3a5bce775e try to complete
7 years ago
Yi Wang 767f453ab8
Add cpplint pre-commit hook (#9511)
7 years ago
JiayiFeng a469666e42 fix compile errors
7 years ago
fengjiayi 2f856769b9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_double_buffer_fix
7 years ago
fengjiayi 55e4b89f14 remove local_buffer_
7 years ago
fengjiayi 7bb18433fd refine code
7 years ago
Qiao Longfei 63cd5fb0b1
Merge pull request #9523 from jacquesqiao/fix-test_send_recv
7 years ago
Tao Luo 527e6585d1
Merge pull request #9528 from tpatejko/tpatejko/mkldnn-lrn-fix-is_test_failure
7 years ago
Yancey1989 c3580eae46 Add prefetch interface on server side
7 years ago
Yu Yang 53fa7cb9cc Add local cache of double buffer reader
7 years ago
Tao Luo 7102eb2efd
Merge pull request #9531 from luotao1/fix_profiler_test
7 years ago
chengduoZH ffa63974b9 compare the performance of unpinned memory and pinned memory
7 years ago
Tomasz Patejko b9874251c6 Plain LRN op throws an exception when is_test is set in backward pass
7 years ago
Luo Tao 5baa529e0e fix compiler error of profiler_test in ONLY_CPU mode
7 years ago
fengjiayi 95658767eb
Merge pull request #9428 from JiayiFeng/kernel_of_increment_op
7 years ago
typhoonzero 52439d9f1d Merge branch 'fix-test_send_recv' of https://github.com/jacquesqiao/Paddle into fix_server_shutdown
7 years ago
typhoonzero f6de248323 fix server shutdown
7 years ago
chengduo 81d93514d6
Merge pull request #9522 from chengduoZH/feature/refine_parallel_exe
7 years ago
Qiao Longfei 23bab34ca3
Fix data transform when inplace (#9450)
7 years ago
chengduoZH 60d0a0594e refine parallel
7 years ago
Yancey 374f1ca3b7 Fix dist error with lr decay layer (#9489)
7 years ago
Qiao Longfei f0af1398b8
add prefetch_op (#9495)
7 years ago
Yu Yang fa21436d0d
Merge pull request #9080 from reyoung/cpp_parallel_executor
7 years ago
dzhwinter fbdb5b7b43 "fix based on comment"
7 years ago
Yi Wang c1c5e166d1 Fix cpplint errors
7 years ago
Yi Wang 64242c5d71 Rename test_serde into serde_test
7 years ago
Abhinav Arora 5f9da86ba5
Fix the order of reads and write from buffered channel (#9423)
7 years ago
Yang Yu af230d9bef Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
qiaolongfei e727cdb62d fix block num
7 years ago
dzhwinter a80bf702f3 Merge remote-tracking branch 'origin/develop' into speed/sequence_expand
7 years ago
fengjiayi 1a4b0d63e4
Merge pull request #9352 from JiayiFeng/doc_update_reader_doc
7 years ago
dzhwinter 8425c2c859
Speed/sequence op1 (#9217)
7 years ago
guosheng 5b8bb34470 Refine reshape_op by following comments.
7 years ago
fengjiayi 1e4f442a84 fix a compile error
7 years ago
武毅 d21ab2e2ba
Merge pull request #9448 from typhoonzero/fix_dist_slr_height
7 years ago
chengduo 24100e1fb8
Merge pull request #9449 from chengduoZH/feature/add_cos
7 years ago
fengjiayi 869ef01d66 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into kernel_of_increment_op
7 years ago
JiayiFeng 52574733a6 Add KernelType switch for IncrementOp kernel
7 years ago
typhoonzero 96192a85ec Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_dist_slr_height
7 years ago
Abhinav Arora f5da16e51b
Disabling channel test to debug issue (#9491)
7 years ago
JiayiFeng 0ac43217ce check whether scalar condition var is on CPU before using
7 years ago
typhoonzero 450be963fe fix sparse errors
7 years ago
chengduoZH bdda08d9f2 add sin
7 years ago
JiayiFeng 01c5ca7364 fix bugs
7 years ago
Yu Yang e868950e5f Add comments
7 years ago
JiayiFeng 917b205c1c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into kernel_of_increment_op
7 years ago
chengduoZH 2e577379ca add cos
7 years ago
Tao Luo 857a8997de
Merge pull request #9384 from luotao1/removeVar
7 years ago
dzhwinter 0412f5e09b "fix ci"
7 years ago
Yu Yang 38b53b37b4 Remove Pop method
7 years ago
Yu Yang ce2f096372 Merge branch 'cpp_parallel_executor' of github.com:reyoung/Paddle into cpp_parallel_executor
7 years ago
Yu Yang 7da1ea07a2 Use PopAll
7 years ago
fengjiayi 802dcd676e remove CPU restrict in While_op
7 years ago
dzhwinter 0be1e09f2c "fix ci"
7 years ago
Yang Yu b0775588c0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
typhoonzero 9a9d67dac2 fix dist train selected rows height missing
7 years ago
Yu Yang 084cdd1f4f Rename code
7 years ago
chengduoZH 58a9f9f781 set the max size of cudapinned memory
7 years ago
fengjiayi 6dfc33c226 fix compile errors
7 years ago
fengjiayi e9370fe59f fix compile bugs
7 years ago
fengjiayi 0ce558f19e kernels of increment op
7 years ago
guosheng 4bfbc59122 Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into enhance-ReshapeOp
7 years ago
guosheng c078ed4608 Enhance reshape_op by adding Input(Shape)
7 years ago
yi.wu cc1c6afbbf fix slr serde
7 years ago
typhoonzero 094d509689 fix slr deser
7 years ago
typhoonzero 587781153e fix slr deser
7 years ago
Luo Tao 7f4012247e adjust remove rule for variables
7 years ago
Yu Yang 201f79d039 Use Extend method
7 years ago
Yu Yang dcf7bd2d92 Add initP2P
7 years ago
Yu Yang 50f71f5005 Using blocking queue
7 years ago
qingqing01 25317bd312
Make the first device share data with the global scope in parallel_do_op. (#9398)
7 years ago
Yu Yang 7dcb217e31 Refine allreduce op
7 years ago
Yu Yang c0c2e15920 NCCL AllReduce
7 years ago
Yu Yang 3f88fad08c Fix merge op
7 years ago
Yu Yang 5b92dd4026 Remove dev sync
7 years ago
Yu Yang 52dd8ff09a Force sync dev
7 years ago
Yu Yang dfb8680018 Early drop fetch op
7 years ago
Yu Yang 9af870854e Use heap variables
7 years ago
Yu Yang 222763296f Change fetch op
7 years ago
Yu Yang 76570c2e96 Wait fetch op
7 years ago
Yu Yang b6ca3711b4 Get error
7 years ago
Yu Yang 55e2cc3d87 FetchOp Force sync
7 years ago
Yu Yang 5a02739ce9 Throw error
7 years ago
Yu Yang f385228f05 Add Paddle Enforce
7 years ago
Yu Yang 833e522d16 Enhance drop kids
7 years ago
Yu Yang aba46f077b Disable P2P
7 years ago
dzhwinter 5447046aee merge develop branch
7 years ago
dzhwinter db1b128feb "add details"
7 years ago
gongweibao e0b5691e41
Add drop_out_op unit test (#9364)
7 years ago
chengduoZH ab601c19c3 Add CUDAPinnedPlace
7 years ago
Tao Luo 1b67bc022c
Merge pull request #9329 from tpatejko/tpatejko/mkldnn-lrn
7 years ago
Abhinav Arora 65534c4762
Fluid channels should match the semantics of Go Channels (#9265)
7 years ago
chengduoZH 158d6c4d19 add unit test
7 years ago
Luo Tao ccfec1bcb1 remove vars when remove ops
7 years ago
chengduoZH 18eb77303d add CUDAPinnedPlace
7 years ago
Qiao Longfei f3dc3112cc
add split ids op (#9370)
7 years ago
chengduo 2e4a398638
Merge pull request #9216 from chengduoZH/feature/add_pinned_memory
7 years ago
Tao Luo c858f48979
Merge pull request #8887 from luotao1/infer_mkl
7 years ago
typhoonzero 1ab4fcb5e7 prepare pserver executor
7 years ago
chengduoZH 9e99446e25 Add note for cudaMallocHost
7 years ago
chengduoZH a0e2cf03e4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_pinned_memory
7 years ago
Yu Yang 9dd64d83f3 WMT Model
7 years ago
chengduoZH 39004080f4 replace use_pinned with is_pinned
7 years ago
Yu Yang cb40c33137 Update unittest
7 years ago
Yu Yang ee97687f69 Fix compile
7 years ago
Yu Yang 3aa2a8ffcf Follow comments
7 years ago
Yu Yang 02aaecca35 Fix CPU compile
7 years ago
Yu Yang 54bd17fe7b Complete Flowers
7 years ago
Xin Pan 3941c2ddec
Merge pull request #9355 from panyx0718/layer_norm
7 years ago
Luo Tao 6332bd1ed8 Merge branch 'develop' into infer_mkl
7 years ago
Yu Yang 50e7e25db3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
Yu Yang 5c7a523326 Add Graphviz output
7 years ago
Qiao Longfei 4f522fa8d5
fix compile send_op on mac (#9360)
7 years ago
Yancey 1b0a17f415
Merge pull request #9303 from Yancey1989/split_send_op
7 years ago
Yancey1989 ebbb428db9 fix ci
7 years ago
Tao Luo cb3bbbd5c6
Merge pull request #9081 from kbinias/kbinias/mkldnn-activations
7 years ago
Qiao Longfei 8ccc61f334
support empty tensor (#9338)
7 years ago
chengduo 4a92e89623
Merge pull request #9337 from chengduoZH/feature/fix_concat
7 years ago
武毅 12856c5f69
Merge pull request #9325 from dzhwinter/fix/dropout1
7 years ago
chengduoZH aca9180a76 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/fix_concat
7 years ago
Xin Pan 1a4be55a47 Pass cpu build
7 years ago
Xin Pan 904fa05f46 Improve layer_norm speed
7 years ago
Yancey1989 79af7cc9d3 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into split_send_op
7 years ago
Yancey1989 081b782434 update by comment
7 years ago
fengjiayi dd532e2086 refine MultiPassReader's doc string
7 years ago
gongweibao cffe1a9112
Profiler can get elapsed time of `sendop` (#9345)
7 years ago
Darcy 8090eb6272 added proto_desc to device_tracer's dep list (#9342)
7 years ago
Yu Yang edfd741e3a Add simple python wrapper for ParallelExecutor
7 years ago
Yu Yang a7b0d5bd26 Clean code
7 years ago
Yu Yang e3144393e3 Extract Executors to indie modules
7 years ago
Yu Yang c70b60dd70 Make executor steal graph inside
7 years ago
Yu Yang 4c3361cda8 Extract GraphExecutor
7 years ago
Yu Yang b123e43bf9 extract multi devices graph builder
7 years ago
Krzysztof Binias d8bd436fc1 Fixed tests
7 years ago
Krzysztof Binias a64b312e3a Correcting for PR comments
7 years ago
Krzysztof Binias 4466f0bec8 MKLDNN Relu Tanh Sqrt Abs activations added
7 years ago
chengduoZH 750aff10ce code refine
7 years ago
chengduoZH 043f47b27f fix concat op
7 years ago
yi.wu bb815d4364 update
7 years ago
yi.wu a9a228ad8d fix dist compile
7 years ago
Luo Tao ae820a34bc Merge branch 'develop' into infer_mkl
7 years ago
Varun Arora 76ae540f8e
Move Select to concurrency.py; incorporate outputs (#9136)
7 years ago
武毅 9c35b0dc1b
Merge pull request #9287 from typhoonzero/pserver_prepare_before_run
7 years ago
Tomasz Patejko 14ba67c0ef Function for running MKLDNN primitive added. Unittest added for is_test attribute
7 years ago
Tao Luo e027eb40d7
Merge pull request #9123 from tpatejko/tpatejko/mkldnn-lrn
7 years ago
dzhwinter e33af2414b "fast hack"
7 years ago
typhoonzero 9367f11eb7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into pserver_prepare_before_run
7 years ago
Yancey ee7f1ecd7c
Fix dist compile error (#9320)
7 years ago
Tao Luo 9126e626fc
Merge pull request #9165 from ROCmSoftwarePlatform/amd_cmake_01
7 years ago
guosheng b7e83d2467 Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into enhance-ReshapeOp
7 years ago
qingqing01 8f8728635a
Fix bug for backward tanspiler when using parallel_do operator. (#9282)
7 years ago
typhoonzero a88cc46221 update
7 years ago
typhoonzero 972a102b92 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into pserver_prepare_before_run
7 years ago
typhoonzero 5e6276edc1 fix transpiler bug
7 years ago
dzhwinter 13f1050ab0
"fix mixed_vector bug" (#9319)
7 years ago
Liu Yiqun 5419da6e7a Fix bug caused by block_id.
7 years ago
Yu Yang 9e3e424ecb
Merge pull request #9299 from reyoung/feature/refactor_batch_norm
7 years ago
Liu Yiqun 987a32dac3 Merge branch 'develop' into core_inference_fix_run
7 years ago
Yu Yang dd73d18bb7 Extract SSAGraph
7 years ago
sabreshao e0ac6bc436 CMake refine for HIP support.
7 years ago
gongweibao 990d6396fe
Reuduce memory copy when communication between trainer and pserver. (#9271)
7 years ago
whs b594251f89
Merge pull request #9082 from wanghaoshuang/average_model
7 years ago
Yu Yang 1d8fe2a220 Enhance device context pool (#9293)
7 years ago
Kexin Zhao 64c5c8f8b0
Merge pull request #9269 from kexinzhao/softmax_cudnn_fp16
7 years ago
Kexin Zhao b9e6364e3c
Merge pull request #9267 from kexinzhao/new_relu_fp16
7 years ago
Kexin Zhao 4eaa789730 resolve conflict
7 years ago
Tomasz Patejko 72cc64e40e Device blobs are created only in training. Added testing attribute
7 years ago
dzhwinter 53c8c36a04 "debug the process"
7 years ago
tensor-tang 7260e3a443
Merge pull request #9214 from jczaja/prv-softmax-mkldnn-operator-PR
7 years ago
Yu Yang 79989c9025 Add SSA builder
7 years ago
Yancey1989 2a4221ac07 split send op to send_vars and send_barrier
7 years ago
Yu Yang 0760aaf440 Shrink batch_norm_grad's inputs
7 years ago
guosheng 454b0a96be Remove the extra call of ValidateShape in ReshapeKernel
7 years ago
guosheng 437f7a3279 Resolve conflict according to the latest code
7 years ago