Paddle

Commit Graph

Author	SHA1	Message	Date
phlrain	0e40298949	fix matmul shape check; test=develop	6 years ago
phlrain	56c2d384c7	add elementwise floordiv, mod; test=develop	6 years ago
Wu Yi	b7baeed7bb	fix win gpu build test=develop (#16334 )	6 years ago
liuwei1031	df5d19aa9d	temoprarily disable the code of use kCUDNN, test=develop (#16205 ) * temoprarily disable the code of use kCUDNN, test=develop * add TODO comment, test=develop	6 years ago
ruri	09e05a110b	Merge pull request #16217 from ceci3/doc fix formula in dropout	6 years ago
zhhsplendid	124f1df481	Add flags for init and re-alloc gpu test=develop	6 years ago
Zhen Wang	8965819fbb	rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop	6 years ago
Wu Yi	8bebfe5640	add resnet nccl2 dist training, mp training unit test (#16167 ) * add resnet nccl2 test=develop * test dist train test=develop * update test=develop * increase timeout test=develop * test on CI env test=develop	6 years ago
Wojciech Uss	cbe2dbf0db	Add enabling quantization (#16326 ) * Add enabling quantization test=develop * remove unused (here) function	6 years ago
lujun	09442fb27e	checkpoint pr be moved here, test=develop	6 years ago
Tao Luo	9a05859179	Merge pull request #16322 from wojtuss/wojtuss/fix_cpu_quantize_pass fix pattern maching conv2d with(out) ResidualData	6 years ago
qingqing01	8caa785e83	Enhance affine_channel_op infer-shape check (#16317 ) * Enhance affine_channel_op infer-shape check	6 years ago
flame	08838f3909	Fix save inference model bug (#16242 ) * save infer model bug fix, return target vars' name list	6 years ago
Kaipeng Deng	957ea995fc	Merge pull request #16243 from heavengate/batch_norm_not_persistent not use PERSISTENT in batch_norm. test=develop	6 years ago
nhzlx	4f4daa4b66	cherry-pick from feature/anakin-engine: add data type for zero copy #16313 1. refine anakin engine 2. add data type for zero copy align dev branch and PaddlePaddle:feature/anakin-engine brach the cudnn workspace modify was not included for now, because we use a hard code way in feature/anakin-engine branch. There should be a better way to implement it, and subsequent submissions will be made. test=develop	6 years ago
nhzlx	07dcf2856c	git cherry-pick from feature/anakin-engine: update anakin subgraph #16278	6 years ago
nhzlx	c407dfa3cb	cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276	6 years ago
nhzlx	a25331bc26	cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189	6 years ago
nhzlx	c79f06d3d8	cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178	6 years ago
nhzlx	69d37f81d7	cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157 support change input size	6 years ago
nhzlx	a1d200a5de	cherry-pick from feature/anakin-engine: Anakin support facebox #16111	6 years ago
flame	a32d420043	cherry-pick from feature/anakin-engine: batch norm (#16110 ) * use anakin batch norm and scale implement fluid batch norm	6 years ago
flame	0945b97f07	cherry-pick feature/anakin-engine: add anakin softmax/transpose/batch_norm/flatten/reshape op (#16020 ) * add anakin softmax/ flatten/reshape/transpose/batch_norm op converter	6 years ago
nhzlx	b21770a2aa	cherry-pick from feature/anakin-engine: Add subgraph fuse support and anakin engine #16018	6 years ago
nhzlx	084310f536	paddle-anakin: concat, split, pool2d converter#16003	6 years ago
flame	be523baad2	Add anakin conv2d/relu/sigmoid/tanh converter (#15997 ) * add activation op * test conv2d relu sigmoid tanh	6 years ago
Yan Chunwei	d0ce6a9044	fix anakin converter registry (#15993 )	6 years ago
Tao Luo	a5124ee0bb	Merge pull request #16301 from luotao1/runtime_context_pass add runtime_context_cache_pass	6 years ago
lujun	622fe6a56b	checkpoint pr be moved here, test=develop	6 years ago
baojun	2de263a5d9	Add softmax_with_cross_entropy_op to ngraph engine (#16304 ) * Add softmax_with_cross_entropy_op test=develop * simplify implementation test=develop	6 years ago
sneaxiy	bb166a1e10	fix API.spec test=develop	6 years ago
ruri	a3b8028d46	Merge pull request #16202 from shippingwang/add_sqrt_doc update sqrt explaination	6 years ago
phlrain	dd080b17c3	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_sequence_pad_2	6 years ago
phlrain	1580be5d6c	fix sequence pad; test=develop	6 years ago
dengkaipeng	aba2713ffc	fix comment. test=develop	6 years ago
chengduo	f26ba5bddd	Fuse AllReduce (#15921 ) * fuse all_reduce test=develop * add fuse_parameter_groups_size test=develop * Polish code test=develop * Fix travis-ci test=develop * Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize test=develop * Add SetGroupAccordingToMemorySize test=develop * fix multi_devices_graph test=develop * reset params_grads test=develop * Polish code test=develop	6 years ago
Zeng Jinle	d0ef682552	Merge pull request #16274 from sneaxiy/fix_grad_maker Remove unused variables in op grad maker	6 years ago
baojun	804afc51db	Minor ngraph fix (#16270 ) * take care edge cases test=develop * use pragma test=develop	6 years ago
Tao Luo	9195c3bb03	Merge pull request #16280 from luotao1/cos_sim_infershape refine cos_sim infershape	6 years ago
Wojciech Uss	104a9f1e27	fix pattern maching conv2d with(out) ResidualData test=develop	6 years ago
Wu Yi	6382b62f6b	Collective ops (#15572 ) * wip allreduce in op * wip * wip * wip * wip adding test * wip for conflict with mp mode * fix tests test=develop * fix cpu build test=develop * fix travis clang format test=develop * fix cpu build test=develop * update api.spec test=develop * delete comment test=develop * fix cpplint test=develop * fix test=develop * follow comment test=develop * add file test=develop * fix build test=develop * update test=develop * to be compatible with sync_bn, and fix mp mode in develop test=develop	6 years ago
lujun	bed0ecf3d2	checkpoint pr be moved here, test=develop	6 years ago
lujun	5bb04ea47d	Merge pull request #12 from PaddlePaddle/develop merge to local	6 years ago
sneaxiy	023a3a3d62	fix op grad maker test=develop	6 years ago
luotao1	82af8031d9	add runtime_context_cache_pass test=develop	6 years ago
Zhen Wang	ec88b6cc5a	add channel wise quantization in ir pass.	6 years ago
Tao Luo	b9fc80a133	Merge pull request #16287 from PaddlePaddle/revert-16002-runtime_context Revert "cache runtime_context"	6 years ago
whs	18911b6eea	[enhence] Make step_input of dynamic_rnn support custom lod level. (#15972 ) * Make step_input support custom lod level. test=develop * Fix API.spec test=develop * Fix API.spec. test=develop * Fix API.spec test=develop * Add default value in document of step_input. test=develop * Fix document. test=develop * Fix API.spec test=develop	6 years ago
zhhsplendid	22715487dc	add allocator flags test=develop	6 years ago
luotao1	c05af910bc	refine cos_sim infershape test=develop	6 years ago

... 2 3 4 5 6 ...

23241 Commits (a446d26e8ad805fd6c37a7f2a44b01fe28ffbd9e) All Branches Search

23241 Commits (a446d26e8ad805fd6c37a7f2a44b01fe28ffbd9e)

All Branches