Commit Graph

15217 Commits (5eb81fe595a95758ee01450f600850273c97a197)

Author SHA1 Message Date
baojun 910196524d NGraph Added dropout and dropout_grad to ngraph test=develop (#17320)
6 years ago
mozga-intel b189480734 Ngraph Enable gather operator test=develop (#17296)
6 years ago
lvmengsi 4ef631013c Double backward sqrt (#17387)
6 years ago
JesseyXujin 829fcc98fb
Fix some APIs' example
6 years ago
Zeng Jinle eab34b2df6
fix_dygraph_mem_leak, test=develop (#17396)
6 years ago
lvmengsi 5d1ac41b00 Double backward reduce mean (#17372)
6 years ago
jerrywgz 0cae5a36b6
enhance generate mask labels, test=develop (#17380)
6 years ago
Kaipeng Deng bd9bef5a4e
add elementwise_add_grad_grad op (#17366)
6 years ago
jerrywgz 1c6d064627
add collect fpn proposals op,test=develop (#16074)
6 years ago
Kaipeng Deng 60be66e2c0
support fc_op double grad (#17317)
6 years ago
Zhen Wang ad8bbe587e
Fix some api example codes' bugs and these APIs include load_inference_model, load_vars, save_vars, L1DecayRegularizer and L2DecayRegularizer. (#17324)
6 years ago
Tao Luo 68ec0a6f74
make parallel_executor support FLAGS_use_mkldnn (#17341)
6 years ago
liuwei1031 0863599323
Fix the uninitialized gru_value.output_value. (#17197)
6 years ago
zhoukunsheng 2ff7ea3337 Expose sign op (#17117)
6 years ago
tianshuo78520a f0acc36684 test=develop (#17357)
6 years ago
Yihua Xu 218d8d8f73 Optimize the computing kernel of sequence_reverse operator (#17349)
6 years ago
Yiqun Liu dcda20233c
Optimize the elementwise op using eigen (#15494)
6 years ago
Jiabin Yang 4624d7c642
test=develop, add gradient sort backward strategy (#17125)
6 years ago
qingqing01 1d0ba5e815
Fix the example code in some Python API (#17333)
6 years ago
Kaipeng Deng 8bae8590ac
add double grad for elementwise_mul op (#17255)
6 years ago
Kaipeng Deng 11d3a38f25
add double grad for square op (#17173)
6 years ago
Jiabin Yang 31536016ea
test=develop, test=document_preview, fix 13 api doc and code (#17293)
6 years ago
chengduo bc833945a4
Add DropLocalExeScopes in ParallelExecutor (#17297)
6 years ago
zhoukunsheng d4b67e1692 Add Where Op(#16793)
6 years ago
zhoukunsheng 1bfff02047 Add Diag Op(#17027)
6 years ago
zhaoyuchen2018 8a2caacdbc
improve gru unit performance. (#16338)
6 years ago
SunGaofeng ddb24d48c5
test=develop (#17322)
6 years ago
qingqing01 e32c9888f5
Double backward of conv2d. (#17211)
6 years ago
Zeng Jinle 5e5e7b3305
fix data_type error message (#17312)
6 years ago
Zeng Jinle fff270eacd
follow comments,test=develop (#17273)
6 years ago
Zhaolong Xing 7a3bb061d8
fix: (#17279)
6 years ago
xiaoting 50ad9046c9 add import, test=develop (#17229)
6 years ago
zhoukunsheng 4292bd8687 Mod floordiv (#17251)
6 years ago
guru4elephant 5d6a1fcf16
fix infer_from_dataset and train_from_dataset (#17243)
6 years ago
chengduo 516317cf91
use sync copy (#17291)
6 years ago
Huihuang Zheng 2c4462711f
Fix API example code of save_inference_model (#17274)
6 years ago
xiaoting 9ed4aaada4 modified formula for Lrn (#17281)
6 years ago
zhaoyuchen2018 792443ef23
Refine elementwise kernel. (#16952)
6 years ago
lujun e388a1fb66
Repair api example (#17221)
6 years ago
Yiqun Liu 6b84688ba2
Optimize the cuda implementation of sum_op (#17283)
6 years ago
chengduo db5e74ab95
update assert (#17282)
6 years ago
Hongyu Liu c3195de522
Fix concat shape check (#17247)
6 years ago
lvmengsi dab71e8d97
Fix api example (#17231)
6 years ago
whs 7d7e29957f Fix bp of roi perspective transform op. (#17216)
6 years ago
baojun 7bd1d03ee5 Adding lrn op for ngraph engine (#17189)
6 years ago
Wojciech Uss 984aa90583 improved unit test output (#17266)
6 years ago
chengduo 8f534696b7
Polish Executor and Compiler doc (#17262)
6 years ago
tianshuo78520a dd86b40058 document_preview (#17166)
6 years ago
gongweibao 91784f8ec3
Fix code in document. (#17237)
6 years ago
chengduo 04bd413acb
Code Clean: Move all pass to paddle::framework::ir (#17228)
6 years ago
Huihuang Zheng 648320bb6c
Fix some data and reader related API code (#17202)
6 years ago
Zeng Jinle f2fa3f7300
fix api doc,test=develop (#17241)
6 years ago
Zeng Jinle 4f8594088d
Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace (#17225)
6 years ago
baojun e782b54b9c update sofmax with axis arg test=develop (#17190)
6 years ago
tensor-tang 71f0c6d5bd
fix api doc of hash, relu, concat, argmin, argmax, argsoft and all activations (#17235)
6 years ago
Zeng Jinle 6fafd37e12
fix retry_allocator (#17245)
6 years ago
Tao Luo ff1661f12a
remove unused FLAGS_warpctc_dir (#17162)
6 years ago
Kaipeng Deng a71d8fdb87
Softmax_cross_entropy op add axis (#16806)
6 years ago
songhao c2e20e2a29 fix build warning like 'comparison between signed and unsigned (#17240)
6 years ago
Zhen Wang a914d9b116
Quant output scale (#17215)
6 years ago
zhaoyuchen2018 32b62c25af
optimize sum op (#16820)
6 years ago
石晓伟 a72dbe9abf
Cherry-pick benchmark related changes from release/1.4 (#17156)
6 years ago
Tao Luo 16922e0093
fix api_example of tree_conv (#17239)
6 years ago
jerrywgz ef66baedc0
Refine api doc (#17230)
6 years ago
Leo Zhao 54636a1982 call SetNumThreads everytime to avoid missing omp thread setting (#17224)
6 years ago
Yibing Liu 6b0f27e802
Fix some APIs' example (#17214)
6 years ago
ruri 5817077c99
Fix unexecutable API examples (#17218)
6 years ago
jerrywgz cc95a7516c
fix distribute fpn proposals, test=develop (#16152)
6 years ago
Tao Luo 9ec4615deb
fix profiler and name_scope API examples (#17212)
6 years ago
Zeng Jinle c5eeecca7c
Fix tensor_py.h (#17195)
6 years ago
Zeng Jinle ee2028a110
Add use_cuda to inplace pass (#17205)
6 years ago
chengduo 950aec55fd
It doesn't need sync when fetch_list nit not empty (#17201)
6 years ago
jerrywgz a72907bbf4
Enhance concat op to support empty input. (#17015)
6 years ago
wopeizl 83c4f7721f
use two GPUs to run the exclusive test test=develop (#17187)
6 years ago
chengduo 3c6ab799cd
Remove unnecessary set_devices (#17158)
6 years ago
guru4elephant f938ccec62
remove async executor python api to fix document (#17174)
6 years ago
Zeng Jinle 5dfe2ab9e8
Fix mem leak when converting Tensor to numpy array (#17182)
6 years ago
Huihuang Zheng e4a5332416
Fix a typo in gpu_info.cc (#17175)
6 years ago
tensor-tang 79ed1c76cd
fix bn fuse vardesc and add model saver (#17143)
6 years ago
Zeng Jinle 4e1bc6e805
Rewrite inplace pass and fix gc bug (#17126)
6 years ago
Zeng Jinle 08773b6069
fix reader default stream,test=develop (#17106)
6 years ago
xiaoting bc48453b73 polish the label_smooth (#17138)
6 years ago
Leo Zhao bf4b21fa3d fix assertion failure issue when test_analyzer_bert uses ngraph (#17148)
6 years ago
tangwei12 deb510d451
cvm op feature (#17081)
6 years ago
wopeizl 3acb3635c2
1. move the API check into CPU process (#17110)
6 years ago
tianshuo78520a 92ce445227 Supplementary monitoring file reason explanation (#17131)
6 years ago
Zeng Jinle 28d69d710a
Refine dropout gpu memory (#17095)
6 years ago
Huihuang Zheng b9494058b3
Use CudnnWorkspaceHandle in exhaustive search (#17082)
6 years ago
tianshuo78520a 2192e7bb61 Path flag (#17105)
6 years ago
xiaoting 7da7881c0e Detailed coordinate description for yolov3 loss (#17007)
6 years ago
chengduo 794a195881
fix fuse optimizer ops (#17102)
6 years ago
ceci3 258e000be6
test=develop, double backward leaky_relu (#17067)
6 years ago
Kaipeng Deng 10c487eb21
fix interpolate cu. test=develop (#17101)
6 years ago
Tao Luo aca60e9a20
remove unnecessary prepare_data (#17080)
6 years ago
whs 55ce36e981
Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)
6 years ago
Zeng Jinle 842ded14b0
fix reference_count_pass,test=develop (#17060)
6 years ago
Yibing Liu beda78258f
Init mixed precision training interface (#16856)
6 years ago
Yan Xu 0b07eef118
ParallelDyGraph with GPU collective mode (#16827)
6 years ago
Tao Luo d9cd989825
Merge pull request #17048 from luotao1/fix_runtime_cache_bug
6 years ago
wopeizl f5d6937fe1
specify the cuda arch name and bin to decrease the compile time for i… (#17020)
6 years ago
chengduo cc31681687
use fast executor as default (#17044)
6 years ago
chengduo a2be4b4d91
Add fuse momenutum ops (#16745)
6 years ago
guru4elephant 03d469ad98
Merge pull request #17005 from wopeizl/fix_ncclwrapper_win1
6 years ago
tangwei12 13295d90d9
load persistables with selected rows, test=develop (#17047)
6 years ago
luotao1 490e746269 fix runtime_context_cache bug when gpu model has an op runs only on cpu
6 years ago
Zeng Jinle 0c335dcd2c
Make conv cudnn workspace size configurable (#17036)
6 years ago
jerrywgz ea3504c7ec
Merge pull request #17017 from jerrywgz/fix_potential_hung
6 years ago
qingqing01 c1c2633a63
Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. (#16862)
6 years ago
tangwei12 45136b1b41 fix bug in save, test=develop
6 years ago
jerrywgz 47013af0a6
Merge pull request #17011 from jerrywgz/enhance_generate_proposal_labels
6 years ago
tianshuo78520a 73a360b504 Cmakelists fix (#17018)
6 years ago
liuwei1031 a770ce0615
add doc for memory_optimize, test=develop (#17010)
6 years ago
wopeizl d9991dccdd
add parallel build script to ci … (#16901)
6 years ago
jerrywgz b2df6de860 fix potential hung in generate proposals, test=develop
6 years ago
Zeng Jinle 24923f7604
fix py_reader demo (#16997)
6 years ago
qingqing01 ea42e431f8
Speed unit testing. (#16978)
6 years ago
jerrywgz d3a66fc616 enhance generate proposal labels, test=develop
6 years ago
wopeizl 51a0243a56 fix nccl wrapper on windows
6 years ago
Zeng Jinle 1202d3fc74
Refine model gpu memory (#16993)
6 years ago
Yibing Liu 3c375751f8
Support seq len equal to 0 in sequence ops (#16935)
6 years ago
Tao Luo c017025531
Merge pull request #16981 from luotao1/disable_runtime_context_default
6 years ago
Yibing Liu 36c05d36ab
Check some shapes only in runtime (#16919)
6 years ago
Tao Luo aa7b975bf6 disable runtime_context_cache pass by default
6 years ago
Zhaolong Xing 27cd3efdd1
Merge pull request #16969 from NHZlX/fix_trt_anakin_compile_rely
6 years ago
tianshuo78520a 3242e88b70 fix cmakelist detecting problems (#16944)
6 years ago
jiaqi 8bcba3db84
Merge pull request #16896 from xjqbest/develop
6 years ago
nhzlx bc6b0ca1f4 fix trt anakin subgraph compile rely
6 years ago
guru4elephant bbc6c5714f
Merge pull request #16887 from guru4elephant/add_nccl_context_pybind
6 years ago
gongweibao cbdb8a17b1
Polish DGC code (#16818)
6 years ago
lujun dbf66dd034
Merge pull request #16954 from junjun315/fix-dygraph-checkpoint
6 years ago
Tao Luo aa9caa1691
Merge pull request #16951 from luotao1/reduce_ci_time
6 years ago
Guo Sheng 9f1d4a152b
Merge pull request #16902 from guoshengCS/refine-infer-shape
6 years ago
Guo Sheng caf2848356
Merge pull request #16898 from Superjomn/fix/logical_op_infershape
6 years ago
lujun a7c11979ba fix dygraph save/load checkpoint error, test=develop
6 years ago
Tao Luo bc037c13c7 use multi-thread to speedup CI tests
6 years ago
tangwei12 2b61db07d1
fix sampling id op bug (#16909)
6 years ago
Tao Luo 5b1565a7be
Merge pull request #16875 from lidanqing-intel/lidanqing/improve_preprocess_script
6 years ago
Kevin c474e7ddf5 fix overflow by int32 mul test=develop (#16794)
6 years ago
Hongyu Liu baf60e3a27
Merge pull request #16907 from xuezhong/fix_infershape_bug2
6 years ago
Yan Chunwei 8cff2b4231
Update logical_op.cc
6 years ago
Hongyu Liu 40be9590d4
Merge pull request #16897 from velconia/fix_split_lod_tensor_op_infer_shape
6 years ago
Hongyu Liu d68fb792f8
Merge pull request #16890 from colourful-tree/dev
6 years ago
Hongyu Liu ad2a2bb063
Merge pull request #16913 from phlrain/fix_bpr_loss
6 years ago
Hongyu Liu 8bd549bb68
Merge pull request #16861 from tensor-tang/refine/infershape
6 years ago
Hongyu Liu 9d5d44f939
Merge pull request #16840 from phlrain/fix_shape_check_many
6 years ago
dongdaxiang 2ab2869c2d fix GPU compile error problem
6 years ago
dongdaxiang 466d177d09 add pybind dependency
6 years ago
SunGaofeng 0508c9869c
Merge pull request #16853 from SunGaofeng/affine_modify
6 years ago
tangwei12 008fd785fd
fix/positive negative pair op (#16895)
6 years ago
Hongyu Liu d5a7c09856
Merge pull request #16798 from phlrain/softmax_cross_support_high_rank
6 years ago