fengjiayi
2f856769b9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_double_buffer_fix
7 years ago
fengjiayi
55e4b89f14
remove local_buffer_
7 years ago
fengjiayi
7bb18433fd
refine code
7 years ago
Qiao Longfei
63cd5fb0b1
Merge pull request #9523 from jacquesqiao/fix-test_send_recv
...
fix send_recv_op_test
7 years ago
Yancey1989
c3580eae46
Add prefetch interface on server side
7 years ago
Yu Yang
53fa7cb9cc
Add local cache of double buffer reader
7 years ago
Tomasz Patejko
b9874251c6
Plain LRN op throws an exception when is_test is set in backward pass
7 years ago
fengjiayi
95658767eb
Merge pull request #9428 from JiayiFeng/kernel_of_increment_op
...
kernels of IncrementOp
7 years ago
typhoonzero
52439d9f1d
Merge branch 'fix-test_send_recv' of https://github.com/jacquesqiao/Paddle into fix_server_shutdown
7 years ago
typhoonzero
f6de248323
fix server shutdown
7 years ago
Yancey
374f1ca3b7
Fix dist error with lr decay layer ( #9489 )
...
Fix dist error with lr decay layer
7 years ago
Qiao Longfei
f0af1398b8
add prefetch_op ( #9495 )
...
* add prefetch_op
* fix ci
* optimize code
* optimize code
* fix include
7 years ago
dzhwinter
fbdb5b7b43
"fix based on comment"
7 years ago
Yi Wang
c1c5e166d1
Fix cpplint errors
7 years ago
Yi Wang
64242c5d71
Rename test_serde into serde_test
7 years ago
Yang Yu
af230d9bef
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
qiaolongfei
e727cdb62d
fix block num
7 years ago
dzhwinter
a80bf702f3
Merge remote-tracking branch 'origin/develop' into speed/sequence_expand
7 years ago
fengjiayi
1a4b0d63e4
Merge pull request #9352 from JiayiFeng/doc_update_reader_doc
...
Update c++ readers doc
7 years ago
dzhwinter
8425c2c859
Speed/sequence op1 ( #9217 )
...
* "add functors"
* "remove old code"
* "fix"
* "fix ci"
* "add details"
* "fix ci"
* "fix ci"
* "fix ci"
* "fix ci"
* "remove unused code"
7 years ago
guosheng
5b8bb34470
Refine reshape_op by following comments.
7 years ago
fengjiayi
1e4f442a84
fix a compile error
7 years ago
武毅
d21ab2e2ba
Merge pull request #9448 from typhoonzero/fix_dist_slr_height
...
fix dist train selected rows height missing
7 years ago
chengduo
24100e1fb8
Merge pull request #9449 from chengduoZH/feature/add_cos
...
Add cos and sin
7 years ago
JiayiFeng
52574733a6
Add KernelType switch for IncrementOp kernel
7 years ago
JiayiFeng
0ac43217ce
check whether scalar condition var is on CPU before using
7 years ago
typhoonzero
450be963fe
fix sparse errors
7 years ago
chengduoZH
bdda08d9f2
add sin
7 years ago
JiayiFeng
01c5ca7364
fix bugs
7 years ago
JiayiFeng
917b205c1c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into kernel_of_increment_op
7 years ago
chengduoZH
2e577379ca
add cos
7 years ago
dzhwinter
0412f5e09b
"fix ci"
7 years ago
fengjiayi
802dcd676e
remove CPU restrict in While_op
7 years ago
dzhwinter
0be1e09f2c
"fix ci"
7 years ago
Yang Yu
b0775588c0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
typhoonzero
9a9d67dac2
fix dist train selected rows height missing
7 years ago
fengjiayi
6dfc33c226
fix compile errors
7 years ago
fengjiayi
e9370fe59f
fix compile bugs
7 years ago
fengjiayi
0ce558f19e
kernels of increment op
7 years ago
guosheng
4bfbc59122
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into enhance-ReshapeOp
7 years ago
guosheng
c078ed4608
Enhance reshape_op by adding Input(Shape)
7 years ago
yi.wu
cc1c6afbbf
fix slr serde
7 years ago
typhoonzero
094d509689
fix slr deser
7 years ago
typhoonzero
587781153e
fix slr deser
7 years ago
qingqing01
25317bd312
Make the first device share data with the global scope in parallel_do_op. ( #9398 )
7 years ago
dzhwinter
5447046aee
merge develop branch
7 years ago
dzhwinter
db1b128feb
"add details"
7 years ago
gongweibao
e0b5691e41
Add drop_out_op unit test ( #9364 )
7 years ago
chengduoZH
ab601c19c3
Add CUDAPinnedPlace
7 years ago
Tao Luo
1b67bc022c
Merge pull request #9329 from tpatejko/tpatejko/mkldnn-lrn
...
Improvements for MKLDNN LRN
7 years ago
Abhinav Arora
65534c4762
Fluid channels should match the semantics of Go Channels ( #9265 )
...
* Fluid Channel should match Go Channel in Semantics
* Fix Python channel_send
* Address code rveiew feedback
* Fix open_files_op.cc
* Add description to Channel Asserts
7 years ago
Qiao Longfei
f3dc3112cc
add split ids op ( #9370 )
...
* add split_ids_op
* add TestSplitIdsOp
* fix comment
* add test for empty tensor
* clean code
* rm unused code
7 years ago
Tao Luo
c858f48979
Merge pull request #8887 from luotao1/infer_mkl
...
add MKL for fluid static and shared library
7 years ago
typhoonzero
1ab4fcb5e7
prepare pserver executor
7 years ago
Yu Yang
02aaecca35
Fix CPU compile
7 years ago
Xin Pan
3941c2ddec
Merge pull request #9355 from panyx0718/layer_norm
...
Improve layer_norm speed
7 years ago
Luo Tao
6332bd1ed8
Merge branch 'develop' into infer_mkl
7 years ago
Yu Yang
50e7e25db3
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
Qiao Longfei
4f522fa8d5
fix compile send_op on mac ( #9360 )
7 years ago
Yancey
1b0a17f415
Merge pull request #9303 from Yancey1989/split_send_op
...
Split send op to send_vars and send_barrier
7 years ago
Yancey1989
ebbb428db9
fix ci
7 years ago
Tao Luo
cb3bbbd5c6
Merge pull request #9081 from kbinias/kbinias/mkldnn-activations
...
MKLDNN Relu Tanh Sqrt Abs activations added
7 years ago
chengduo
4a92e89623
Merge pull request #9337 from chengduoZH/feature/fix_concat
...
Fix concat_op
7 years ago
武毅
12856c5f69
Merge pull request #9325 from dzhwinter/fix/dropout1
...
"fast hack"
7 years ago
chengduoZH
aca9180a76
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/fix_concat
7 years ago
Xin Pan
1a4be55a47
Pass cpu build
7 years ago
Xin Pan
904fa05f46
Improve layer_norm speed
...
transfomer on a single device step time
reduces from 0.157 to 0.125
7 years ago
Yancey1989
79af7cc9d3
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into split_send_op
7 years ago
Yancey1989
081b782434
update by comment
7 years ago
fengjiayi
dd532e2086
refine MultiPassReader's doc string
7 years ago
gongweibao
cffe1a9112
Profiler can get elapsed time of `sendop` ( #9345 )
7 years ago
Krzysztof Binias
d8bd436fc1
Fixed tests
7 years ago
Krzysztof Binias
a64b312e3a
Correcting for PR comments
7 years ago
Krzysztof Binias
4466f0bec8
MKLDNN Relu Tanh Sqrt Abs activations added
7 years ago
chengduoZH
750aff10ce
code refine
7 years ago
chengduoZH
043f47b27f
fix concat op
7 years ago
yi.wu
bb815d4364
update
7 years ago
yi.wu
a9a228ad8d
fix dist compile
7 years ago
Luo Tao
ae820a34bc
Merge branch 'develop' into infer_mkl
7 years ago
Varun Arora
76ae540f8e
Move Select to concurrency.py; incorporate outputs ( #9136 )
...
* Move Select to concurrency.py; incorporate outputs
* CLang formatting for concurrency
* Remove extra bracket - formatting fix - 3
* Comment fix
7 years ago
武毅
9c35b0dc1b
Merge pull request #9287 from typhoonzero/pserver_prepare_before_run
...
Pserver prepare before run
7 years ago
Tomasz Patejko
14ba67c0ef
Function for running MKLDNN primitive added. Unittest added for is_test attribute
7 years ago
Tao Luo
e027eb40d7
Merge pull request #9123 from tpatejko/tpatejko/mkldnn-lrn
...
Implementation of MKLDNN LRN
7 years ago
dzhwinter
e33af2414b
"fast hack"
7 years ago
typhoonzero
9367f11eb7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into pserver_prepare_before_run
7 years ago
Yancey
ee7f1ecd7c
Fix dist compile error ( #9320 )
7 years ago
Tao Luo
9126e626fc
Merge pull request #9165 from ROCmSoftwarePlatform/amd_cmake_01
...
Demostration of cmake refine for HIP support.
7 years ago
guosheng
b7e83d2467
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into enhance-ReshapeOp
7 years ago
qingqing01
8f8728635a
Fix bug for backward tanspiler when using parallel_do operator. ( #9282 )
...
* Temporarily fix bug for backward tanspiler when using parallel_do operator.
* Fix bug for backward tanspiler when using parallel_do operator
7 years ago
typhoonzero
a88cc46221
update
7 years ago
typhoonzero
972a102b92
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into pserver_prepare_before_run
7 years ago
typhoonzero
5e6276edc1
fix transpiler bug
7 years ago
Yu Yang
9e3e424ecb
Merge pull request #9299 from reyoung/feature/refactor_batch_norm
...
Shrink batch_norm_grad's inputs
7 years ago
Liu Yiqun
987a32dac3
Merge branch 'develop' into core_inference_fix_run
7 years ago
gongweibao
990d6396fe
Reuduce memory copy when communication between trainer and pserver. ( #9271 )
7 years ago
whs
b594251f89
Merge pull request #9082 from wanghaoshuang/average_model
...
Add model average optimizer for fluid
7 years ago
Kexin Zhao
64c5c8f8b0
Merge pull request #9269 from kexinzhao/softmax_cudnn_fp16
...
Add float16 support to cudnn softmax kernel
7 years ago
Kexin Zhao
b9e6364e3c
Merge pull request #9267 from kexinzhao/new_relu_fp16
...
Add float16 support to relu op
7 years ago
Kexin Zhao
4eaa789730
resolve conflict
7 years ago
Tomasz Patejko
72cc64e40e
Device blobs are created only in training. Added testing attribute
7 years ago
dzhwinter
53c8c36a04
"debug the process"
7 years ago
tensor-tang
7260e3a443
Merge pull request #9214 from jczaja/prv-softmax-mkldnn-operator-PR
...
Softmax MKLDNN FLUID operator
7 years ago
Yancey1989
2a4221ac07
split send op to send_vars and send_barrier
7 years ago
Yu Yang
0760aaf440
Shrink batch_norm_grad's inputs
7 years ago
guosheng
454b0a96be
Remove the extra call of ValidateShape in ReshapeKernel
7 years ago
guosheng
437f7a3279
Resolve conflict according to the latest code
7 years ago
Jacek Czaja
3b95b55f07
- Softmax MKLDNN primitive integration
...
removed diagnostic
- Added Unit tests for Softmax MKLDNN Forward
Added fix for div by 0 to happen in cross_entropy backward
Conflicts:
paddle/fluid/operators/CMakeLists.txt
- Cosmetic fixes to SoftMax MKLDNN fluid operator
Added misssing softmax fluid operator file
Disabled MKLDNN softmax operator by default
Fix to softmax op unittest merge
clang_formater fixes
clang_formatter fixes
- Name changing of softmax mkldnn operator to maintin consistency
across codebase
- updated comment
fix to comment
7 years ago
guosheng
eb12cbe764
Refine reshape_op infershape
7 years ago
Liu Yiqun
0968753454
Enable the test of not creating variables every time.
7 years ago
typhoonzero
1eec926124
updates
7 years ago
typhoonzero
e9d815e32b
prepare and create op before run
7 years ago
Kexin Zhao
ed2bc194c5
Merge pull request #9176 from kexinzhao/batch_norm_fp16
...
Add float16 support to batch norm operator
7 years ago
fengjiayi
809530f418
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_MultiEpochReader
7 years ago
fengjiayi
7c041e48f4
Merge pull request #9182 from JiayiFeng/dev_MultipleReader
...
Multi-threaded reader in C++
7 years ago
typhoonzero
18461d0935
wip
7 years ago
wanghaoshuang
edb4e29ab7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into average_model
7 years ago
Kexin Zhao
b7801b9fcb
small fix
7 years ago
Kexin Zhao
70e7122785
initial commit
7 years ago
Kexin Zhao
d60180af39
inital commit
7 years ago
Kexin Zhao
c1e9b1e37e
Merge pull request #9231 from kexinzhao/elementwise_add_fp16
...
Add float16 support to Elementwise Add op
7 years ago
dzhwinter
e4c35d837d
"add details"
7 years ago
fengjiayi
91b6d60003
Merge branch 'fix_bug_in_recordio' into dev_MultiEpochReader
7 years ago
fengjiayi
2532b922dc
Add more unittests and fix bugs
7 years ago
dzhwinter
26822bd774
"add sequence kernel"
7 years ago
wanghaoshuang
ad63722ed9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into average_model
7 years ago
fengjiayi
f863866471
Add an unitest
7 years ago
武毅
5008020d19
Merge pull request #9154 from typhoonzero/pserver_parallel
...
pserver runs in parallel
7 years ago
fengjiayi
02b7d8bea5
Merge branch 'fix_bug_in_recordio' into dev_MultipleReader
7 years ago
typhoonzero
3666d7c02f
fix num_blocks==2
7 years ago
sabreshao
e50205e744
CMake refine for HIP support.
...
1. Add option WITH_AMD_GPU.
2. Add cmake/hip.cmake for HIP toolchain.
3. Some external module such as eigen may need HIP port.
4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
7 years ago
fengjiayi
a2981f5c50
fix a bug
7 years ago
Yang yaming
381c6a026d
Merge pull request #9100 from pkuyym/fix-9049
...
Enhance sequence_expand operator
7 years ago
Kexin Zhao
d307b5e4a6
Merge remote-tracking branch 'upstream/develop' into elementwise_add_fp16
7 years ago
typhoonzero
139ae08fdf
workable
7 years ago
Kexin Zhao
5271c32d24
Merge pull request #9223 from kexinzhao/dropout_fp16
...
Add float16 support to dropout operator
7 years ago
Kexin Zhao
3da094fd7b
rearrange test
7 years ago
fengjiayi
832deee448
Merge pull request #9178 from JiayiFeng/fix_bugs_in_reader
...
Fix bugs in c++ readers
7 years ago
wanghaoshuang
e01c770c05
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into average_model
7 years ago
wanghaoshuang
d22f4de794
Refine sum_accumulates_op.
7 years ago
yangyaming
2c22552542
Fix some comments and adapt test_machine_translation.py.
7 years ago
fengjiayi
6f7e812bb3
fix bugs
7 years ago
dzhwinter
4ee1c9e60d
"add sequence expand kernel"
7 years ago
yangyaming
2f2c5f5e60
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-9049
7 years ago
Kexin Zhao
4bf168b274
add fp16 kernel for elementwise add
7 years ago
Kexin Zhao
d03dbb97f9
remove AttrType
7 years ago
Kexin Zhao
05ad15832a
initial commit
7 years ago
Xi Chen
9eae086e39
add math_function to softmax's dep list
7 years ago
emailweixu
b3f076a6e4
Merge pull request #9168 from emailweixu/fix_compile
...
Fix compilation for gcc5.4
7 years ago
yangyaming
869a6f9cea
Add python wrapper.
7 years ago
fengjiayi
d9868b0839
Add multi_pass_reader
7 years ago
Tomasz Patejko
2d95527527
Removing WITHIN_CHANNEL algorithm for lrn. CPU lrn operator works only with ACROSS_CHANNELS
7 years ago
Tomasz Patejko
c51c446221
Content of GetExpectedKernelType moved to standalone function
7 years ago
Tomasz Patejko
192cc5dd32
Implementation of MKLDNN LRN
7 years ago
yangyaming
332b665fc7
Enhanced cpp implementation and unit test.
7 years ago
caoying03
a6e64242d8
follow comments.
7 years ago
Yu Yang
9cb8f50302
Complete fetch op
7 years ago
fengjiayi
07d38a9b9a
refine patch
7 years ago
fengjiayi
a571ef382e
fix bugs
7 years ago
Kexin Zhao
446d54f5c3
update
7 years ago
Kexin Zhao
ffa22a5f90
fix scaling param type
7 years ago
Tao Luo
c0421379b7
Merge pull request #9043 from Xreki/core_inference_remove_clone
...
Remove unnecessary clone of program in C++ Executor.Run
7 years ago
caoying03
c87d11a716
Merge branch 'develop' into enhance_reshape
7 years ago
Kexin Zhao
e870947cfd
fix batch norm fp16 param type
7 years ago
wanghaoshuang
92a01d4994
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into average_model
7 years ago
wanghaoshuang
e0b136c0f9
Refine average accumulates op
...
1. Rename inputs and outputs
2. Add some comments
7 years ago
typhoonzero
093e07d39e
need to split var scopes
7 years ago
fengjiayi
3d677b1eca
fix compile errors and make OpenFilesOpMaker derived from FileReaderMakerBase
7 years ago
fengjiayi
550622529c
Add MultipleReader and open_files_op
7 years ago
Kexin Zhao
0a95a44b9a
add python batch norm inference test
7 years ago
Kexin Zhao
df99b16a16
Merge pull request #9167 from kexinzhao/pool2d_fp16
...
Add float16 support for pool 2d operator
7 years ago
Kexin Zhao
39c676e208
initial commit
7 years ago
xuwei06
ab3543e35e
Fix compilation for gcc5.4
...
The error is:
paddle/fluid/operators/math/concat.cc:47:72: error: invalid initialization of non-const reference of type 'paddle::platform::CPUPlace&' from an rvalue of type 'paddle::platform::CPUPlace'
auto& cpu_place = boost::get<platform::CPUPlace>(context.GetPlace());
Should not use reference for cpu_place.
7 years ago
Kexin Zhao
8ebfc153dd
update
7 years ago
Kexin Zhao
3f5705c346
Merge pull request #9148 from kexinzhao/cast_op_fp16
...
Add float16 support for cast op
7 years ago
Kexin Zhao
bfbc25bdb8
add fp16 pool2d support
7 years ago
yangyaming
3b03e3748d
Refine some ENFORCE.
7 years ago
yangyaming
58730ba131
Enhance unit test.
7 years ago
yangyaming
bf3f56e899
Finish adaption for backward.
7 years ago
sabreshao
45c988d86a
Demostration of cmake refine for HIP support.
...
1. Add option WITH_AMD_GPU.
2. Add cmake/hip.cmake for HIP toolchain.
3. Some external module such as eigen may need HIP port.
4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
7 years ago
Liu Yiqun
371c53f88c
Add profiling event in feed, fetch and load op.
7 years ago
Yu Yang
5e87cd7574
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
Yu Yang
8b397d1602
Make recordio file reader thread-safe by default
7 years ago
Yu Yang
6f0dfd89a4
Single GPU ParallelExecutor complete
7 years ago
typhoonzero
b8f4c8599e
pserver runs in parallel
7 years ago
Kexin Zhao
8e7310146f
Merge pull request #9143 from kexinzhao/numpy_conv2d_pool2d_fp16
...
Add float16 support for cudnn conv2d
7 years ago
Kexin Zhao
f3c5e81556
add fp16 for cast op
7 years ago
Tao Luo
a448fbe9e1
Merge pull request #9134 from putcn/fix-selected-row-dep
...
add math_function to selected_rows_functor dependency list
7 years ago
Tao Luo
20be8e7e33
Merge pull request #9104 from ranqiu92/doc_dir
...
change the dir of docs
7 years ago
qingqing01
7c1a0b77a0
Delete the detection_output_op, which had been split into several operators. ( #9121 )
7 years ago
Kexin Zhao
e967d19b0a
add more tests
7 years ago
Kexin Zhao
a13ec3432a
fix test error
7 years ago
Kexin Zhao
e4de5dc347
add conv2d fp16 support
7 years ago
Xi Chen
d20c6eb6de
add math_function to selected_rows_functor dependency list
7 years ago
qingqing01
1cd700d8e8
Fix bug in LRN operator. ( #9124 )
7 years ago
ranqiu
64775126f3
change the dir of docs
7 years ago
qingqing01
b5a16dca20
Fix a critical bug in softmax_with_cross_entropy_op backward. ( #9120 )
...
* Fix a critical bug in softmax_with_cross_entropy_op, which will lead to the wrong gradients.
* Enhance unit testing.
7 years ago
Yu Yang
d84ddcf123
Stash
7 years ago
Thuan Nguyen
1e4c504e60
Implement Select OP ( #9088 )
...
* Fix old documentation for channel_recv
* Initial design of CSP select
* Redesign channel implementation for Select Op
* Remove unecessary header
* Initial checkin of select op, currently will read all the conditional_op in the cases block and also pull out all channels involved in the select.
* Init python select op API
* Python select bug fix when checking op creates block
* Add case_to_execute as (a) input to select, (b) into the passed inputs into the select op
* Add in addition code for select op
* Init fibonacci test from python
* implement fibonnaci sequence test
* update fib unit test
* Improve select test cases
* Shorten non-pep-8-ed lines
* Add methods on channel needed by select op
* Fix compile issues, finish implementation, still need to debug code
* Fix issue with fibonncci test, it works now!
* Change QueueMessage callback to take in an ChannelAction enum, fix select unit test
* Fix case attributes
* Fix issue with select control flow
* Make cases - previously on each selectcase conditional_block - attributes to select
* Use class constants for type of channel
* Change select op to take in "cases" attribute
* return boolean from select callback function to tell Channel if this RECV or SEND should be executed
* Improve attributes and inputs comments on select op
* Fix issues with python unit test
* Assert fibonacci final output
* Fix issue when channel name / channel var is null for "default" case in select op
* Assert base select test output
* Make QueueMessage use shared pointer and modify the order of the callback
* Fixing the order in which the callback is called
* Move channel utility methods to paddle/fluid/operators/concurrency/channel_util
* Create channel_util and move channel util methods
* Fix crash when calling select_op
* Fix deadlock
* Fix issue of channel destructor deadlock
* Fix precommit issues
* Accidentally checked in changes to beam_search_op, reverting change.
* Fix dependency issue in concurrency cmake
* add device_context dependency for concurrency target
7 years ago
Xin Pan
d284cf88e5
Merge pull request #9037 from panyx0718/develop
...
Better timeline
7 years ago
dzhwinter
128adf53cb
[Speed]implement cudnn sequence softmax cudnn ( #8978 )
...
* "add softmax cudnn functor support"
* "add testing"
* "refine cmakelist"
* "sequence softmax forward speed up"
* "add softmax grad"
* "fix sequence softmax test"
* "add double precision'
* "fix softmax test"
* "add softmax cudnn support"
* "fix softmax cudnn test"
* "add softmax to nn.py"
* "fix compile bug"
* "refine cmakelist"
* "fix ci"
* "fix based on comment"
* "fix based on comments"
* "fix ci"
7 years ago
yangyaming
352fa41a16
Finish adapting forward.
7 years ago
wanghaoshuang
d7e5e1f13d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into average_model
7 years ago
Kexin Zhao
e26f1123da
Add fp16 mul op support and bind paddle fp16 to numpy fp16 ( #9017 )
...
* add fp16 mul op support
* small fix
* fix bug
* small fix
* fix PADDLE_WITH_CUDA compiling issue
* reorg code
* test for pybind
* treate as float16 as uint16_t in pybind
* bind np.float16 to paddle float16
* small fix
* clean code
* remove redundancy
* fix mul_op test
* address comments
* small fix
* add is_float16_supported func
7 years ago
dzhwinter
7140071152
"exported scatter to python" ( #9038 )
...
* "exported scatter to python"
* Revert ""exported scatter to python""
This reverts commit 38745a626c3f937bec836c92c98a76deadf0a03d.
* "polish scatter and export to python"
7 years ago
chengduo
11c43e5da3
Merge pull request #9072 from chengduoZH/feature/refine_parallel_do
...
Refine parallel_do_grad
7 years ago
wanghaoshuang
8a645685ce
Add sum accumulator with window for model average
7 years ago
Xin Pan
4840c49b27
Better timeline
7 years ago
chengduoZH
ef28e7deba
refine parallel_do_grad
7 years ago
Yu Yang
48f213e5a1
Merge pull request #8991 from reyoung/feature/shuffle_reader
...
Feature/shuffle reader
7 years ago
Cao Ying
881c5227ab
Merge pull request #8843 from zhouhanqing/Paddle-ReduceProd
...
Add product reduction for reduce op.
7 years ago
武毅
d13ce35875
Feature/send recv can now retry ( #9027 )
7 years ago
dzhwinter
14fe40aaa6
Refine/nccl ( #9009 )
...
* "Refine nccl op"
* "refine code "
* "refine nccl code"
7 years ago
chengduo
788c600e9d
Merge pull request #8932 from chengduoZH/feature/add_concat_rows
...
Enhance look_up_table op
7 years ago
chengduoZH
92e2207e18
refine doc
7 years ago
Yu Yang
164f2382af
Polish code
7 years ago
chengduoZH
ff09b21cd0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_concat_rows
7 years ago
Yu Yang
e13aec601a
Merge pull request #8830 from reyoung/feature/recordio_file_reader
...
Feature/recordio file reader
7 years ago
Yu Yang
f9974a4a12
Make double_buffer reader async
7 years ago
Yu Yang
a8c076e577
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/shuffle_reader
7 years ago
Luo Tao
de13f0eb4e
Merge branch 'develop' into infer_mkl
7 years ago
chengduoZH
b9397b2668
remove concat_rows
7 years ago
QI JUN
7287630e83
Repair nccl op test ( #8575 )
...
* fix nccl op unit test
* fix build error
* format code
* refine nccl related unit test
* fix build error
* add setGPUData
* clean up
* follow comments
* rm test_nccl.cu
* follow comment
* rm wait
7 years ago
caoying03
cf08185145
fix bugs and complete codes.
7 years ago
Yu Yang
225efa671f
Remove dims in base class
7 years ago
QI JUN
f7e9fe57d3
[Memory]More memory optimization policy ( #8690 )
...
* add memopt level
* add opt level for image classification demo
* clean code
* add delete op
* clean code
* test machine translation demo
* clean code
* clean code
* skip fill constant with force cpu
* clean code
* clean code
* refine code
* clean code
* fix bug
7 years ago
Yu Yang
2ea4a5d96c
Polish double buffer reader
7 years ago
kexinzhao
607eec30a8
Merge pull request #8946 from kexinzhao/fix_cuda_arch_fp16
...
Add GPU compute capability check for float16 math function test
7 years ago
Yancey
b5ef315cf1
Fix dist compile error ( #8987 )
7 years ago
qingqing01
b3d26cd3ad
Fix bug in detection_output and mAP calculation in SSD. ( #8985 )
...
* Clipping bbox in the mAP evaluator calculation.
* Fix bug in detection_output and mAP calculation in SSD.
* Fix bug in detection.py.
* Fix bug in test_detection_map_op.py.
7 years ago
Yu Yang
46ae4075ee
Polish ShuffleReader and test
7 years ago
chengduoZH
f1c3ecb2b2
add concat rows
7 years ago
chengduo
685f03762e
Merge pull request #8890 from chengduoZH/feature/fix_bug_of_elementwise
...
Add ElementwiseOpInferVarType for Elementwise_op
7 years ago
Kexin Zhao
3b44b849d3
address comments
7 years ago
fengjiayi
dd1244f3c9
Merge pull request #8943 from JiayiFeng/fix_bugs_in_readers
...
Fix a potential bug in the c++ reader
7 years ago
Yu Yang
7eedced82a
Polish RecordIO
7 years ago
caoying03
a8cdd97ef5
Merge branch 'develop' into enhance_reshape
7 years ago
caoying03
1d4dfc0966
fix bugs.
7 years ago
Yu Yang
cfca8a3a26
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/recordio_file_reader
7 years ago
Yu Yang
fea43077f6
Refine
7 years ago
pzelazko-intel
4730a4be24
MKLDNN pool2d OP kernel added ( #8879 )
...
* MKLDNN pool2d OP kernel added
* conv2d and pool2d MKLDNN kernels renamed
* MKLDNN conv2d kernel refactoring
7 years ago
Kexin Zhao
95de7617eb
fix bug
7 years ago
Kexin Zhao
1998d5afa2
add gpu info func to get compute cap
7 years ago
Kexin Zhao
d400b4192d
fix math function arch mismatch for older GPU
7 years ago
fengjiayi
614c33fb3a
fix a potential bug in the c++ reader
7 years ago
chengduoZH
1509ce6638
enhancement look_up_table
7 years ago
fengjiayi
aa3f5058d3
Merge pull request #8841 from JiayiFeng/dev_double_buffer_for_cpp_reader
...
Basic double buffer for cpp reader
7 years ago
QI JUN
b341bac7e1
Refine cast op ( #8923 )
...
* fix mac build error
* override GetExpectedKernelType for cast op
* fix typo
* add cuda unittest
7 years ago
Yancey
8468037918
Fix sparse update memory error for distributed training ( #8837 )
...
Fix sparse update memory error for distributed training
7 years ago
fengjiayi
35e1e0d521
uses channel to replace the traditional buffer
7 years ago
fengjiayi
b3a11fdf3a
Merge branch 'rm_reader_HasNext' into dev_double_buffer_for_cpp_reader
7 years ago
fengjiayi
6e5736e270
fix a compile error
7 years ago
fengjiayi
4e517881f7
remove HasNext
7 years ago
武毅
9dd34e4169
update unpushed commits for zerocopy grpc ( #8900 )
7 years ago
zhouhanqing
9d78971d8b
Some comments have been modified.
7 years ago
zhouhanqing
3ca968441d
Merge branch 'develop' into Paddle-ReduceProd
7 years ago
kexinzhao
90215b7844
Add float16 GEMM math function on GPU ( #8695 )
...
* test cpu float16 data transform
* add isnan etc
* small fix
* fix containsNAN test error
* add data_type transform GPU test
* add float16 GPU example
* fix error
* fix GPU test error
* initial commit
* fix error
* small fix
* add more gemm fp16 tests
* fix error
* add utility function
7 years ago
武毅
45af8c1e99
Performance/zero copy variable seriralization ( #8839 )
7 years ago
chengduoZH
53d19f5b1e
Add ElementwiseOpInferVarType
7 years ago
ying
d3d16f76f5
enhance reshape operator.
7 years ago
Luo Tao
bc0cfb2283
remove PADDLE_USE_ATLAS
7 years ago
qingqing01
ffda2c414d
Clipping bbox in the mAP evaluator calculation. ( #8872 )
7 years ago
Yu Yang
b536799af0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/recordio_file_reader
7 years ago
chengduo
f3cdeb9a29
Merge pull request #8820 from chengduoZH/feature/refine_elementwise_
...
[Speed] Refine elementwise sub,div,min,max gradient functor
7 years ago
qingqing01
ded34b2c0f
Fix detection_map_op for multi-device. ( #8845 )
7 years ago
Yu Yang
72be7a6151
Complete RecordIO reader op
7 years ago
fengjiayi
b1f647fd6d
fix errors
7 years ago
zhouhanqing
732eebb286
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into Paddle-ReduceProd
7 years ago
zhouhanqing
15306ffdc3
add product reduction for reduce_op
7 years ago
fengjiayi
e8d21b6349
fix an error
7 years ago
fengjiayi
4fb7b96756
Add basic double buffer reader
7 years ago
Luo Tao
49f3f1db07
add back framework_proto depends
7 years ago
Luo Tao
3ddc997182
rename concat_functor to concat, refine CMakeLists based on comments
7 years ago
Luo Tao
1ef97fa7b1
Merge branch 'develop' into math_function
7 years ago
chengduo
84aea8a8a1
Merge pull request #8669 from chengduoZH/feature/concat_op
...
Refine concat_op
7 years ago
pzelazko-intel
8c71adaa8c
MKLDNN conv2d kernel added ( #8451 )
...
* MKLDNN conv2 OP kernel added
* TODOs added
* mkldnn conv2d OP refactor
* CanCUDNNBeUsed and CanMKLDNNBeUsed moved
7 years ago
Yan Chunwei
049383c615
add inplace to reshape ( #8747 )
7 years ago
Yu Yang
42e65a20c6
Merge pull request #8791 from reyoung/feature/extract_reader_ops
...
Extract create_reader_op to three files
7 years ago
Yu Yang
10343123e3
Merge branch 'feature/extract_reader_ops' into feature/recordio_file_reader
7 years ago
chengduoZH
8b30fadac3
refine elementwise sub,div,min,max
7 years ago
Yu Yang
4690b9c987
FIX CI
7 years ago
chengduo
c43995ed36
Merge pull request #8810 from chengduoZH/feature/refine_elementwise_mul
...
[Speed]Refine elementwise_mul_op
7 years ago
kexinzhao
266ccaa843
Integrate float16 into data_type_transform ( #8619 )
...
* test cpu float16 data transform
* add isnan etc
* small fix
* fix containsNAN test error
* add data_type transform GPU test
* add float16 GPU example
* fix error
* fix GPU test error
* add context wait
7 years ago
chengduoZH
a1331f9877
refine elementwise_mul_op
7 years ago
qingqing01
0e1f82fd2a
Fix bug in detection mAP evaluator. ( #8778 )
...
* Fix mAP evaluator bug.
* Fix bug in detection mAP evaluator.
* Fix unit testing.
* Support to set background label index in detection mAP op.
7 years ago
Yu Yang
4d8345e3ac
Extract create_reader_op to three files
7 years ago
whs
f608bb2756
Merge pull request #8651 from wanghaoshuang/fix_pool
...
Add ceil_mode option for pool2d and pool3d
7 years ago
Xin Pan
c7b7291b9f
Merge pull request #8758 from panyx0718/nccl
...
[Speed]Avoid init_nccl for every steps.
7 years ago
Xin Pan
a4d68ed3d8
Add lock
7 years ago
qingqing01
608feea204
Implement detection mAP evaluator wrapper and unify label format between SSD loss and mAP evaluator ( #8736 )
...
* Implement mAP evalutor Python interface.
* Fix unit testing and uniy label format between SSD loss and mAP evalutor.
* Update doc.
7 years ago
chengduoZH
c3864eab99
if axis == 0; directly copy D->D
7 years ago
chengduoZH
131ec276ed
fix bug for big number; float->double and code refine
7 years ago
Xin Pan
d054cfeae6
Avoid init_nccl for every steps.
7 years ago
chengduoZH
82bd82c186
follow comments and refine code
7 years ago
qingqing01
457d012a8e
Enable device automatically switching in mine_hard_examples_op. ( #8706 )
7 years ago
chengduoZH
00e596edbe
get max threads of GPU
7 years ago
typhoonzero
f94a758c16
fix fluid distribute build
7 years ago
Luo Tao
f67275a920
refine operator/math/CMakeLists.txt, seperate im2col from math_function
7 years ago
chengduoZH
60e7ee0611
refine concat_op
7 years ago
qingqing01
9e1ec8c919
Enable device switching automatically for serveral operators ( #8684 )
7 years ago
emailweixu
eac2c3cf36
Merge pull request #8505 from emailweixu/math_op
...
Correctly handling variable with batch dimension for math ops.
7 years ago
Abhinav Arora
0d878e4c09
Add Go_op, Channel_create, channel_close, channel_send and channel_receive ops ( #8593 )
...
* Adding Python boilerplate code for Go op
* Add very basic test case
* Adding the python logic for go routine
* Fix syntax
* Changing test to notest
* Rename Routine to Go
* Combining GoGuard and Go in one class
* Modify test
* Adding fluid close channel
* Fixing __init__.py for calling fluid.go()
* Adding stubs for channel methods and updating test case
* Removing import *
* Adding imports from concurrency
* Initial commit of GO_OP (for varun)
* Creating local scopes and go through them
* Updated go op inputs persistability enforcement
* Add thread execution; compile failing though
* Fix go op
* Cleaned up Go op
* Fix yapf format issue
* Readd warp ctc dir for unit tests
* Updated make_channel, channel_send, channel_recv and channel_close
* Moved thread function to another method, update unit tests
* remove output var
* Add stubs for channel operators
* Updating concurrency with signatures
* Updated the signature with return status
* Fixed dtype in variables
* Updating stub of ChannelSend + add infershape
* Updating stub of ChannelRecv + add infershape
* Updated signature
* Adding the channel_create operator
* Merge channel send+receive ops
* Update concurrency tests using all operators
* Updating the create op with ChannelHolder
* Fix issues with channel_create_op
* Add the implementation for channel_close op
* Add channel close operator, fix channel close op
* Adding the channel_send op
* Comment channels C++ and Python code
* Concurrency python api comment fix
* Update unit test to add Status variable
* Adding channel receive operator
* Update concurrency test to demonstrate a complete CSP flow
* Fix clang-format issues
* Fixed "Out" parameter name
* Fixing merge conflict in framework.py
* Add channel ops to framework.py no_kernel_op_set
* Seperating channel_send and channel_recv operators
* Documenting capacity type
* Update concurrency test to create go block as child block of main program
* Changing set status implementation
7 years ago
Yancey
718642e93f
Merge pull request #8659 from Yancey1989/fix_dist_bug
...
Registry var type infer in split_selected_rows op
7 years ago
chengduo
e9f2033175
Merge pull request #8539 from chengduoZH/feature/refine_elementwise_op_function.h
...
Refine Sum in elementwise_op_function
7 years ago
Yancey1989
7bd16fe13b
registry var type infer
7 years ago
Yuan Gao
e84615bae6
Fix box coder op ( #8647 )
...
* fix ssd problems
* fix box decoder op
* fix dimension problem in detection tests
* update detection doc
* Update detection doc
* Update detection doc
* update detection doc
* update detection doc
7 years ago
wanghaoshuang
0ffec514cd
Fix comments
7 years ago
wanghaoshuang
1e510d9914
Add ceil_mode option for pool2d and pool3d
7 years ago
chengduo
cf883d9caf
Merge pull request #8613 from chengduoZH/feature/fix_bug_conv_op
...
Fix conv_op bug
7 years ago
qingqing01
f449180b1c
Register more data type for reshape operator. ( #8617 )
7 years ago
chengduoZH
a779b42450
follow comments
7 years ago
xuwei06
e9b8ebf42c
Correctly handling variable with batch dimension for math ops.
...
When the second argument contains batch dimension, the axis should be 0.
Also makes elementwise ops more tolerant at handling tensors with trailing
singular dimensions.
7 years ago
xuwei06
7d56c6d078
Adding Dim<0>
...
Dim<0> is for scalar (rank-0 tensor). Adding Dim<0> can simplify
a lot of code.
7 years ago
chengduoZH
b5c920920c
fix conv_op bug
7 years ago
Luo Tao
6dd3a61b2f
combine batch_size_like.cc into batch_size_like.h
7 years ago
Tao Luo
1ac31d3d08
Merge pull request #8591 from chengduoZH/feature/refine_cmake_for_cudnn
...
Refine cmake for cudnn op
7 years ago
chengduoZH
62fe2f28a1
follow comments
7 years ago
qingqing01
4948f7b3fe
Enhance bipartite_match_op to support argmax matching after bipartite matching. ( #8580 )
...
* Enhance bipartite_match_op to support argmax matching after bipartite matching.
* Fix typo error.
7 years ago
chengduoZH
16fc5e3893
refine cmake for cudnn
7 years ago
武毅
0250e8083d
Merge pull request #8586 from Yancey1989/fix_dist_unittest
...
Fix send_recv unit test
7 years ago
dzhwinter
34605d2641
accelerate the cuda concat op, avoid many times copy ( #8585 )
...
* "try enhance concat op"
* "enhance the concat operator"
7 years ago
Yancey1989
7a1d6ae5f6
Fix send_recv unit test
7 years ago
typhoonzero
a50230add6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_raw_var_type
7 years ago
chengduoZH
b8938b448c
refine Sum
7 years ago
chengduoZH
a82883922e
follow comments
7 years ago
chengduoZH
22b9ab052d
refine Sum
7 years ago
typhoonzero
9b5be6ef43
fix short connection again
7 years ago
qiaolongfei
28d07e3cb8
add python part of compare op
7 years ago
qiaolongfei
d4e3495cf5
add larger_than and larger_equal op and kernel
7 years ago
qiaolongfei
bad01596f9
rename register macro
7 years ago
Yu Yang
7a9098a60e
Add block.fwd_block_id ( #8489 )
...
* Add block.fwd_block_id
* fix bug in memory optimization transpiler
* Change DFS to BFS
* Add comments
7 years ago
chengduo
0e187bc93e
fix get_mid_dims annotation ( #8490 )
7 years ago
Yu Yang
88c22e9d1a
Speed up elemwise grad ( #8402 )
...
* Speed up elemwise grad
* Fix bug
* Add macro for MAX_BLOCK_DIM
7 years ago
武毅
c490f1b38b
Merge pull request #8049 from typhoonzero/no_counter_on_pserver
...
Enhancement/transpiler rename grad vars to add trainer id, so RPC call can be retried.
7 years ago
Yang Yang(Tony)
3c2cafbe1d
fix parallel do hard coded empty var name ( #8469 )
7 years ago
Yang Yang
ec01f635f5
merge develop
7 years ago
helinwang
6566d7c740
Merge pull request #8458 from tonyyang-svail/rename_output
...
change outputsize func name
7 years ago
Abhinav Arora
c7ad26d6a4
[WIP] Move DataType enum inside VarType ( #8447 )
...
* Move Pod Types from DataType enum to Type enum
* Fixed data_type.h
* Fix type in TensorDesc
* Add comment to framework.proto
* Fixed type in data_type.h
* Updated format of type in data_type.h
* Fix var_desc.h
* Fix op_kernel_type.h
* Fixed data_type_transform_test.cc
* Fix operator.h
* Fixed data_type_transform.cc
* Fixed op_kernel_type_test.cc
* Fix operator.cc
* Fixed data_layout_transform_test.cc
* Fix var_desc.cc
* Fixed assign_value_op.cc
* Fixed assign_value_op.h
* fixed protobuf.cc
* Fix data_layout_transform_test.cc and op_kernel_type_test.cc
* Fixed rnn_memory_helper_op.cc
* Fix progrma_desc_test.cc
* Fixed fill_constant_batch_size_like_op.cc
* Fix operator_test.cc
* Fixed fill_constant_op.cc
* Fixed gaussian_random_op.cc
* Fixed uniform_random_op.cc
* Fixed edit_distance_op.cc
* Fixed fill_constant_batch_size_like_op.cc
* Fixed rnn_memory_helper_op.cc
* Fixed chunk_eval_op.cc
* Fixed assign_value_op.cc
* Fixed assign_value_op.h
* Fixed cast_op.h
* Fixed cast_op.h
* Fix fill constant op
* Fixed clang for assign_value_op.cc
* Fix one_hot_op.h
* Fix one_hot_op.cc
* Fix fill_op.cc
* Fixed sum_op.cc
* Fixed sum_op clang
* Fix uniform_random_op.cc
* Fix gaussian_random_op.cc
* Fix backward.cc
* Fix protobuf.cc
* Fixed prune_test.cc
* Fixed op_registry_test.cc
* Fix data_device_transform_test.cu
* Fix travis error
* Fixed one_hot_op.cu
* Fixed op_registry_test.cc
* Fixed nccl_op.cc
* Fixing python tests
* Revert "Fixing python tests"
This reverts commit fccaa4c5818ed9f379ea1ce4315066cc78076c64.
* Fixing Pybind to remove data type
* Fixing tensor.py
* Updated the new files:
* Resolve error in merge conflict of fill_constant_batch_size_like_op.cc
7 years ago
Yang Yang
cb06337f9e
change outputsize func name
7 years ago
Yang Yang
5229ccbdc7
merge develop
7 years ago
Yang Yang
1d9fd1c006
pass test_recognize_digits
7 years ago
emailweixu
4f122c0760
Remove incorrect statement in compare_op.h ( #8416 )
...
The type of tensor z should be bool. And there's no need to call mutable_data because ElementwiseComputeEx will do it.
7 years ago
emailweixu
6752b06f8c
Generating random numbers with given batch size ( #8337 )
...
* Generating random numbers with given batch size
uniform_random_batch_size_like_op
gaussian_random_batch_size_like_op
* More comments about random seed.
* Move test_*_random_batch_size_like_op to unittests
7 years ago
Yi Wang
cfffb1a362
Update tensor_util.h ( #8422 )
...
* Update tensor_util.h
* Update with moved TensorDesc
* Fix tensur_utils.cu
* Update
* Update
* Update
* Update
* Make tensor_util.cu a symbolic link
7 years ago
Yang Yang
3f09620ef2
pass compile
7 years ago
Yang Yang
e021ad674e
Merge remote-tracking branch 'upstream/develop' into backward_on_parallel_do
7 years ago
typhoonzero
1494673dcb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into no_counter_on_pserver
7 years ago
typhoonzero
8acad27e8d
refine code
7 years ago
QI JUN
8b01546fe4
fix compile warning ( #8430 )
7 years ago
Yang Yang
7129fa3c90
merge develop
7 years ago
emailweixu
3beafff282
Merge pull request #8415 from emailweixu/print_op
...
Make print_op able to show the value of bool tensor
7 years ago
Abhinav Arora
fcadb45251
Separate VarType from VarDesc in framework.proto and fix all related compiler errors ( #8414 )
...
* Refine Type system
* Fixing type inference
* Fixed create_reader_op.cc
* Fix var_desc.h
* Fixed executor.cc
* Fix shape_inference.h
* Fixed create_reader_op.cc
* Fix tensor_util.h
* Fixed var_type_inference_test.cc
* Fix shape_inference.cc
* Fixed sum_op.c
* Fixed read_op.cc
* Fix var_type.h
* Fixed beam_search_decode_op.cc
* sendrecvop_utils.cc
* Fix operator.cc
* Fixed lookup_table_op.cc
* Fixed op_desc.cc
* Fixed get_places_op.cc
* Fixed lod_rank_table_op.cc
* Fixed beam_search_op.cc
* Fix var_desc.cc
* Fixed lod_tensor_to_array_op.cc
* Fixed while_op.cc
* Fix program_desc_test.cc
* tensor_array_read_write_op.cc
* Fix assign_op.cc
* Fix executor.cc
* Fix protobuf.cc
* Fix protobuf.cc
7 years ago
Yang Yang
da97d9dfd6
merge develop
7 years ago
xuwei06
004df46f28
Make print_op able to show the value of bool tensor
...
And some minor fixes on comments.
7 years ago
qingqing01
24509f4af9
Fix the grammar in copyright. ( #8403 )
7 years ago
武毅
c0fc50d5f3
Merge pull request #8409 from typhoonzero/fix_grpc_short_conn
...
Fix grpc short connection
7 years ago
dzhwinter
07923ba006
Memory/dropout4 ( #8407 )
...
* "merge random generator kernel and mul"
* "fix dropout"
7 years ago
typhoonzero
0d4d9c4e13
fix grpc short connection
7 years ago
qingqing01
057efd1709
Implement multibox loss wrapper for SSD in Python API. ( #8385 )
...
* Implement multibox loss wrapper in Python API.
* Add some wrappers for SSD detection.
* Fix conflicts.
* Add unit testing for SSD loss wrapper.
* Update doc in Python API.
* Refine unit testing.
* Add more unit testing and update some interface arguments.
7 years ago
typhoonzero
4b91cb5289
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into no_counter_on_pserver
7 years ago
武毅
9942565fe7
Merge pull request #8386 from typhoonzero/fix_dist_transpiler_develop
...
Fix dist transpiler develop
7 years ago
typhoonzero
dca9941e4b
pass size when copy
7 years ago
Yu Yang
5046869e84
Merge pull request #8287 from tonyyang-svail/operator_set_device
...
Correctly handle cuda place for operators
7 years ago
chengduo
7757a8ad7c
Merge pull request #8265 from chengduoZH/feature/add_prior_box_py
...
Add Python interface of prior_boxes
7 years ago
qingqing01
8a0dd2409e
Expose softmax_with_cross_entropy and smooth_l1 into Python API. ( #8375 )
...
* Add softmax_with_cross_entropy and smooth_l1 in Python API.
* Fix doc format.
7 years ago
Wang Hao
a824da91e6
Merge pull request #6588 from wanghaox/detection_map
...
detection map evaluator for SSD
7 years ago
Qiao Longfei
e9d3099145
Merge pull request #8357 from jacquesqiao/override-compare-op-in-python
...
override comparison operators in Python for Variable
7 years ago
wanghaox
91a2188301
update detection_map
7 years ago
typhoonzero
9a05c90750
fix StridedNumelCopyWithAxis
7 years ago
chengduoZH
5ccab2dc65
remove conflict
7 years ago
Yancey
caf9a09d7b
Merge selected rows with dynamic variable count ( #8023 )
...
* dynamic send/recv selected rows
* update by comment
* fix by comment
7 years ago
typhoonzero
11bcb43a44
fix merge issue
7 years ago
typhoonzero
21071f7106
no create trainer var on listen_and_serv
7 years ago
Yancey
525a4fdae8
Merge pull request #8270 from Yancey1989/improve_concat_split_op
...
Improve split and concat op
7 years ago
Tao Luo
b56f4a4ee2
move code from /paddle/string to /paddle/fluid/string ( #8363 )
7 years ago
Yang Yang
0e2deaa5fd
Merge remote-tracking branch 'pr/8364' into backward_on_parallel_do
7 years ago
whs
274f4e94af
Merge pull request #8334 from wanghaoshuang/fix_seq_expand
...
Fix output dims of sequence expand op
7 years ago
Yancey
279aa626ab
Move learning rate and releated op to pserver ( #8209 )
...
* dist train support lr decay
* update by comment
* revert elementwise method creator
* delete comment
7 years ago
wanghaox
006ef1fd7a
migrate detection_map code directory
7 years ago
qingqing01
72bcf72c66
Refine target_assign_op to unify the classification and regression targets assigning. ( #8326 )
...
* Refine target_assign_op to unify the classification and regression targets assignment.
* Fix the unit testing.
* Fix conflicts.
7 years ago
typhoonzero
b0096361c7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into no_counter_on_pserver
7 years ago
qiaolongfei
6f78cb9969
add not_equal
7 years ago
Yancey1989
235332544c
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into improve_concat_split_op
7 years ago
qingqing01
d38b869438
Fix the input dimension for multiclass_nms_op. ( #8232 )
7 years ago
chengduoZH
282dadb0c6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prior_box_py
7 years ago
Yang Yang
3067114f3a
clean up
7 years ago
Yang Yang
cd9e660d4b
merge develop
7 years ago
Yi Wang
40c7972d46
Merge branch 'develop' of http://github.com/paddlepaddle/paddle into operator_set_device
7 years ago
Yi Wang
a51c8ccc16
Merge branch 'develop' of http://github.com/paddlepaddle/paddle into fix_seq_expand
7 years ago
Luo Tao
77f04fd97a
move paddle/pybind/pybind.h to paddle/fluid/pybind/pybind.h, and cancel the test_parallel_op temporary
7 years ago
Yi Wang
a6c1bffa47
Merge with upstream
7 years ago
Yi Wang
fc374821dd
Correct #include path
7 years ago
Yi Wang
90648f336d
Move file to fluid/; Edit CMakeLists.txt
7 years ago