Commit Graph

1317 Commits (a3b26e85288fe38b52b06445de8a05eb4ea6e6d4)

Author SHA1 Message Date
Xin Pan d24f1f0aa4 Current scope needs to be thread-safe for training
7 years ago
Yan Chunwei c8744d118d
fea/infer executor and concurrency performance issue bug fix (#13451)
7 years ago
chengduo 5175b3cb2b
Add GraphChecker (#13580)
7 years ago
Xin Pan 7cd2761736
Merge pull request #13416 from panyx0718/ir
7 years ago
Yu Yang 0be1582df0
Merge pull request #13525 from reyoung/fix_mixed_vector
7 years ago
Jacek Czaja d5114c60b0 - Reviewers suggesstions to fused_embedding_fc_lstm_op
7 years ago
Jacek Czaja 7ab5626dee - Added initial pass for embedding-fc-lstm
7 years ago
Yan Chunwei 9e8d372ff4
hide attention lstm fuse (#13615)
7 years ago
Wu Yi 10dffc68cd
Merge pull request #13618 from typhoonzero/revert_13530
7 years ago
tangwei12 97cf1eb6d7
Add distributed unit tests about text_classification/simnet-bow/ctr (#12812)
7 years ago
typhoonzero a4f7696a18 Revert "Some trivial optimization (#13530)"
7 years ago
tensor-tang 6938e6cf06
Merge pull request #13603 from tensor-tang/refine/peephole
7 years ago
Wu Yi 16e73e0d28
hide operator API (#12543)
7 years ago
tensor-tang 209e9c3db1 refine peephole
7 years ago
chengduo 1d91a49d2f
Some trivial optimization (#13530)
7 years ago
Tao Luo d0000082c4
Merge pull request #13552 from sfraczek/sfraczek/conv-relu-update
7 years ago
Sylwester Fraczek e5d1bd1e93 remove unused variable nodes2delete
7 years ago
Sylwester Fraczek a49aa4dac9 make bias unnecessary for ConvRelu fuse
7 years ago
Sylwester Fraczek 493ef0c8df do not remove conv node just rewire the output
7 years ago
Sylwester Fraczek 667b661786 updated the test
7 years ago
Xin Pan f117feab0c modify comments
7 years ago
Yu Yang 7119d6c3cf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_callstack
7 years ago
Xin Pan c2b3838cf5 add some comments
7 years ago
Xin Pan 0d9ee0dced fix resolve conflicts
7 years ago
Xin Pan b43e49fa31 fix
7 years ago
Xin Pan 36c2a9af27 pass builder allow cutomize pass in python.
7 years ago
dzhwinter 97636a9fcf
"fix link error" (#13545)
7 years ago
chengduo cdf3a4c244
Fix concat_op InferShape (#13513)
7 years ago
Zeng Jinle f855c05fff
Merge pull request #13520 from sneaxiy/enhance_eager_delete
7 years ago
Yu Yang 3043f51b3a
Merge pull request #13511 from reyoung/fix_ce
7 years ago
Wu Yi 29c63d180f
[Feature] dist op role and lr op role, to support memory optimize with dist training (#13220)
7 years ago
Yu Yang e1913bc5d8 Fix MixedVector
7 years ago
sneaxiy 0a36ef3c18 enhance eager deletion
7 years ago
Yu Yang 6d2c6f96f1 Revert "Revert "Merge pull request #13431 from chengduoZH/refine_lod""
7 years ago
Yu Yang a6c8d6b9a2 Revert "Merge pull request #13431 from chengduoZH/refine_lod"
7 years ago
chengduo d402234ba8
Feature/op_fuse_pass (#12440)
7 years ago
Xin Pan 23ba766252
Merge pull request #13475 from panyx0718/ir5
7 years ago
Xin Pan 759dae0493
Merge pull request #13461 from panyx0718/ir3
7 years ago
Xin Pan 0bd7a67eaf avoid creating dangling ir::Node.
7 years ago
Zeng Jinle 642cf6ca2f
Merge pull request #13418 from sneaxiy/dam_save_memory
7 years ago
Yu Yang aa79bccfff
Merge pull request #13460 from reyoung/fix_data_transform
7 years ago
Yu Yang 2c31ea9293
Merge pull request #13424 from chengduoZH/refine_seq_concat
7 years ago
sneaxiy fd3e32ea7d Merge develop
7 years ago
Xin Pan 5ce77889f4 clean unused inference_optimize c++ implementation
7 years ago
Yu Yang 922dee3b43 Wait input when data transform
7 years ago
chengduoZH 6534f8527a Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_lod
7 years ago
chengduoZH 24459501fe Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_seq_concat
7 years ago
Xin Pan a83a4fab5c
Merge pull request #13441 from panyx0718/ir2
7 years ago
sneaxiy d9942cd122 Merge develop
7 years ago
chengduoZH cdb9605bad refine
7 years ago
sneaxiy 114eb17587 fix executor bug
7 years ago
Xin Pan e5b322051b clean
7 years ago
Xin Pan ec6ee0a293 simplify and hide bcast_params
7 years ago
yuyang c809fee3b0 Fix bug of Vector::Extend
7 years ago
chengduoZH def2a8b1b0 Async memcpy
7 years ago
chengduoZH e7940141ce refine seq_concat
7 years ago
sneaxiy 7116cc12fb Merge develop
7 years ago
sneaxiy abf9832c12 tiny change to save memory
7 years ago
sneaxiy 612e1a3155 modification
7 years ago
sneaxiy d0b2453ecd merge develop
7 years ago
sneaxiy 24ea39c4c6 feature/eager_delete_tensor
7 years ago
Tao Luo 9f03ce6c4a
Merge pull request #13318 from Superjomn/refine/simple_fix
7 years ago
Yan Xu 23ec966cd3
Merge pull request #13360 from Yancey1989/parallel_bcast
7 years ago
Yancey1989 757f9683ab update comment text
7 years ago
Yancey1989 9ee1b7bc04 add some comments
7 years ago
Yancey1989 bad4ea192e update by comment
7 years ago
Yancey1989 1664899b63 update
7 years ago
Yancey1989 2888d2d7bc Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_bcast
7 years ago
Yancey1989 1e1b6622fd update by comment
7 years ago
Sylwester Fraczek dd149d469b hotfix for conv-relu pass
7 years ago
luotao1 b12322ce95 fix fusion_lstm unique_name bug
7 years ago
tensor-tang 62a9821043
Merge pull request #13346 from tensor-tang/refine/infershape
7 years ago
Qiyang Min 6abe03be0c
Merge pull request #13317 from velconia/fix_python35_CI_random_fail
7 years ago
Sylwester Fraczek 41de582bb0 create conv relu pass for MKLDNN (#13258)
7 years ago
Jiabin Yang d46b575e0e
Merge pull request #13349 from JiabinYang/mac/unittest
7 years ago
tensor-tang b0b5f515a9 Merge remote-tracking branch 'ups/develop' into refine/infershape
7 years ago
tensor-tang d61c11764a follow comment add enforce
7 years ago
Yancey1989 5ce1a960a5 move bcast op into pass
7 years ago
tensor-tang 8bb824bb93 refine infershape hasinput and hasoutput
7 years ago
JiabinYang 7dd54afd0c fix program desc unit test error
7 years ago
Yancey1989 23b12c6f58 fix invalide bcast in reduce strategy
7 years ago
tensor-tang e0436ad8bb refine fusion lstm infershape
7 years ago
guochaorong 76e9227467
Merge pull request #13199 from JiayiFeng/fix_CudnnHolder_bug
7 years ago
Xin Pan 17bf8713a5
Merge pull request #12988 from panyx0718/ir2
7 years ago
Yan Chunwei 2fd1bf2ea6
fea/add color log (#13305)
7 years ago
superjomn 6b2f680d88 simple fix
7 years ago
minqiyang dc863aac7e Add kids exists detection in Scope
7 years ago
Yan Chunwei 478a4e850e
refactor ir pattern (#13304)
7 years ago
minqiyang 681514e15f Make all scope pointer to shared
7 years ago
Xin Pan e762d85de4 clean
7 years ago
Xin Pan 9b7c3f9615 refine
7 years ago
Xin Pan ff47eaf45f clean
7 years ago
Xin Pan 4313d870a2 refine
7 years ago
Xin Pan c69cf6dde8 fix
7 years ago
Xin Pan 56a977d436 add test
7 years ago
Xin Pan 926e1077ca version
7 years ago
Yang Yu 8331e835a8 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_CudnnHolder_bug
7 years ago
tensor-tang 3ea19b7596 fix bug and fc pass ut
7 years ago
tensor-tang df0c695618 fix fusion gru pass and enable it
7 years ago
tensor-tang c9bd2d50f1 refine fc and gru pattern
7 years ago
tensor-tang 353d7d1be6
Merge branch 'develop' into refine/ut/lac
7 years ago
Yan Chunwei e2d325ac08
refactor pass_library (#13261)
7 years ago
tensor-tang 0618077971 Merge remote-tracking branch 'ups/develop' into refine/ut/lac
7 years ago
tensor-tang 40dbd97f8e Merge remote-tracking branch 'ups/develop' into refine/op/peephole
7 years ago
tensor-tang 4d774953c6 enable fc gru fuse pass
7 years ago
tensor-tang 74f95b8da0 fix redefine macro
7 years ago
Tao Luo 7dd8adb5fd
Merge pull request #13211 from luotao1/multi-thread2
7 years ago
tensor-tang f057077c3a add fuse fc gru pass
7 years ago
Yan Chunwei f143c9a916
port VisualDL's graphviz theme to IR (#13246)
7 years ago
tensor-tang 3eb55f0643 Merge remote-tracking branch 'ups/develop' into refine/op/peephole
7 years ago
luotao1 35cff5e00d Merge branch 'develop' into multi-thread2
7 years ago
Yan Chunwei 9df2d8b5ba
test/add text-classification test (#13081)
7 years ago
tensor-tang 2f3b498949 refine fusion seq lstm peephole
7 years ago
luotao1 2dc23ffaa8 Merge branch 'develop' into multi-thread2
7 years ago
Xin Pan f94fdeaa0a
Merge pull request #13215 from panyx0718/clean
7 years ago
Brian Liu 04272c0d41 Enable lstm peephole (#13160)
7 years ago
Xin Pan 5ae052bd14 remove some unused codes
7 years ago
luotao1 459d4cc811 Merge branch 'develop' into multi-thread2
7 years ago
fengjiayi db5e3dd767 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_CudnnHolder_bug
7 years ago
Tao Luo 907696709f
Merge pull request #13133 from luotao1/library
7 years ago
Jiabin Yang d091dd02a0 fix mac compile error 0903 (#13184)
7 years ago
Yan Chunwei 2ef34c64c6
refine fc with pattern reusing (#13187)
7 years ago
Yan Chunwei 796c87d563
bugfix/fusion lstm (#13185)
7 years ago
Yu Yang cda7842e26 Revert "Revert "Add Python Callstacks when Op::Run error (#12759)""
7 years ago
fengjiayi 82a1b35b9b Revert "Revert "Add CudnnHolder and use it in Conv and ConvTranspose op""
7 years ago
guochaorong 151e169eb7
Revert "Add CudnnHolder and use it in Conv and ConvTranspose op"
7 years ago
luotao1 6f18217386 fix codestyle
7 years ago
luotao1 9790acbde6 Merge branch 'library' of https://github.com/luotao1/Paddle into luotao1-library
7 years ago
luotao1 d7b4965785 auto generate paddle_inference_pass.h
7 years ago
tensor-tang 1cc35f3642
Merge pull request #13118 from tensor-tang/optimize/op/fusion_lstm
7 years ago
luotao1 0a5d4601e3 Merge branch 'develop' into library
7 years ago
dzhwinter 4fa3cee549
Windows/hidden file (#13139)
7 years ago
luotao1 0639a32477 Merge branch 'develop' into library
7 years ago
fengjiayi 653c8ded7d
Merge pull request #13078 from JiayiFeng/dev_CudnnHolder
7 years ago
luotao1 37d1a6685c Merge branch 'develop' into multi-thread2
7 years ago
tensor-tang 83f4bc4ecf follow comment and refine code
7 years ago
tensor-tang 9838bacb35
Merge branch 'develop' into optimize/op/fusion_lstm
7 years ago
Yan Chunwei 597b73053d
refine/fc lstm fusion link (#13158)
7 years ago
Qiao Longfei 1835e91183
Merge pull request #13099 from jacquesqiao/fix-mac-compile
7 years ago
Jiabin Yang 6ba2b22279
Merge pull request #13096 from JiabinYang/fix_mac
7 years ago
luotao1 fb077c17e6 add shared library for analysis
7 years ago
dzhwinter ab1097cd8e
Feature/template (#13093)
7 years ago
luotao1 0fbe0a7a28 add multi-thread ut for ditu-rnn
7 years ago
tensor-tang 80edd7ef29 enable run with fuse pass
7 years ago
fengjiayi f79ca23115 fix bugs
7 years ago
fengjiayi c501826f42 use framework::RWLock
7 years ago
Xin Pan c709a04ae2
Merge pull request #13018 from panyx0718/name
7 years ago
Yan Chunwei af15f6f038
fea/refine fuse (#13076)
7 years ago
Yu Yang 819af27d63
Merge pull request #13079 from reyoung/feature/fix_bug
7 years ago
qiaolongfei 583e21e60e fix mac compie
7 years ago
Wu Yi 0b8067c0dc
fix dist train reduce mode (#13068)
7 years ago
yuyang18 555f03d8f0 Fix bug when loss@GRAD is reused.
7 years ago
Xin Pan 823c4f87be
Merge pull request #13058 from panyx0718/infer
7 years ago
Jiabin Yang a9be969cf9
Update attention_lstm_fuse_pass.cc
7 years ago
JiabinYang 7c7d3d6172 Fix mac
7 years ago
Yan Chunwei cfa6bbb755
move nodeid from graph to node (#13065)
7 years ago
Xin Pan c558f059ad fix
7 years ago
Xin Pan 7dff0b11c6 fix
7 years ago
Xin Pan 0aa8135b11
Merge pull request #13045 from panyx0718/ir5
7 years ago
Xin Pan d1058003fe fix style
7 years ago
Xin Pan 51ef0ad766 allow to use name_scope for debugging and visiualization
7 years ago
Yan Chunwei 902f19b46a
fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand (#13006)
7 years ago
Xin Pan c720ef972b fix graph_to_program_test to be platform-independent
7 years ago
Xin Pan ec5cb3762c
Merge branch 'develop' into ir5
7 years ago
Xin Pan 2bb15f437c
Merge pull request #12791 from panyx0718/ir3
7 years ago
Yu Yang 8965cee89f
Polish PrintOp (#12895)
7 years ago
Wu Yi 0ee6fed05b
Refine dist rpc deps (#12899)
7 years ago
Tao Luo 0b77518f3a
Merge pull request #12982 from Sand3r-/mutable-data-check
7 years ago
Michal Gallus b6d261dff5 Enforce requested size of tensor to be sufficiently large
7 years ago
Xin Pan cf547e2714 fix program_desc feed/fetch names' order.
7 years ago
Xin Pan 08352fe56a fix
7 years ago
Xin Pan 6fdb7f5348 add test
7 years ago
Xin Pan 1a67061fee graph to program pass
7 years ago
qingqing01 1f09bc320c
Support data type int8_t . (#12841)
7 years ago
dzhwinter 04b1e4dcea
tensor module windows support (#12934)
7 years ago
dzhwinter c790d57cd4
data_type (#12933)
7 years ago
Qiao Longfei d0b713493e
enhance DebugStringEx (#12949)
7 years ago
Tao Luo 3e3b5f4fda
Merge pull request #12675 from Sand3r-/fix-conv-mkldnn-0.15
7 years ago
Xin Pan abeb71c895 small fix of op_desc
7 years ago
guochaorong 1f270275a6 Revert "Add Python Callstacks when Op::Run error (#12759)"
7 years ago
sneaxiy 5ea7bf88ba
Merge pull request #12872 from sneaxiy/stack_op
7 years ago
Tao Luo ef2da86b4f
Merge pull request #12885 from luotao1/test_ditu_rnn
7 years ago
luotao1 9c7fde45a7 enhance test_analyzer to profile ditu inference demo
7 years ago
Wu Yi c6f212a339
Merge pull request #12847 from typhoonzero/add_scope_info_in_graphviz
7 years ago
Wu Yi b8da70c370
Resovle multi gpu async deps (#12828)
7 years ago
Xin Pan 4a4567fc3d
Merge pull request #12876 from panyx0718/ir5
7 years ago
Yu Yang b2df17003f
Add Python Callstacks when Op::Run error (#12759)
7 years ago
Xin Pan 698c926ce5 copy program and fix op_desc
7 years ago
sneaxiy c73c5ed573 use for_range
7 years ago
Xin Pan b548ecbc2b add stack_op
7 years ago
Yan Chunwei 9ee698e605
enhance/ditu rnn with fc fuse (#12831)
7 years ago
typhoonzero dd7a79158b add scope info in graphviz debug
7 years ago
Yan Chunwei 896a37b6e3
fea/link ir to inference analysis and fc fuse support (#12789)
7 years ago
minqiyang beb93bb901 Fix ut bug for graph_test
7 years ago
Xin Pan daf464af68
Merge pull request #12807 from panyx0718/fix
7 years ago
dzhwinter 00463fdfe3
cudnn windows support (#12757)
7 years ago
Xin Pan 4a4c469f61 add test
7 years ago
Xin Pan 7473d5f735 fix program_desc constructor
7 years ago
Yu Yang c6af7201e9
Merge pull request #12692 from reyoung/feature/fast_executor
7 years ago
Xin Pan e525aa232e
Merge pull request #12780 from panyx0718/ir4
7 years ago