Paddle

Commit Graph

Author	SHA1	Message	Date
lujun	1b6a2a09e8	fix mix input type error, test=develop	6 years ago
nhzlx	a1d11bb175	fix ci bug: cudnn handler in multi card test=develop	6 years ago
sneaxiy	2f54d9f995	Merge develop test=develop	6 years ago
sneaxiy	c75a880386	fix windows bug test=develop	6 years ago
sneaxiy	072d95d8f6	Merge develop test=develop	6 years ago
sneaxiy	a93a9eef8f	add op registry type refine gc code test=develop	6 years ago
whs	2e5831f0dc	[slim] Refine framework of slim and add filter pruning strategy (#16226 ) * First pr of paddle slim. 1. Add framework of paddle slim 2. Add filter pruning strategy test=develop * Rename unitest to tests. test=develop * Add prettytable into requirements. test=develop * Change in_nodes and out_nodes to odered dict. test=develop * Remove distillation. test=develop * Fix API.spec test=develop * Fix unitest. test=develop * Fix unitest in windows. test=develop * Fix unitest in windows. test=develop * Fix unitest. test=develop * Hide some functions. test=develop * Fix python import in python3.5 test=develop * Fix compress pass. test=develop * Fix unitest of test_dist_ctr. test=develop * Enhence flops. * use os.path.join * Fix pickle for python3 Fix log and comments. test=develop * 1. Remove feed_reader in compress pass 2. Fix cache reader 3. Rename CompressPass to Compressor 4. Add comments for distiller optimizer 5. Remove unused pruner currently 6. Add some comments. 7. Change API.spec test=develop * Fix pruning in python3. test=develop * Fix unitest in python3. test=develop * Fix format in python3. test=develop	6 years ago
whs	18779b5b8f	[Operator] Add range op. (#15431 ) * Add range op. test=develop * Add more unitests. test=develop * Fix API.spec test=develop * Fix API.spec test=develop * Fix API.spec test=develop	6 years ago
phlrain	7dc4a7f4f8	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_var_name_in_opt_2	6 years ago
Zhen Wang	ec11135d54	Merge pull request #16341 from wzzju/add_channel_wise_in_quant_pass Add channel wise in quant pass.	6 years ago
xiaolil1	e235882c18	Enable MKL-DNN INT8 Concat Kernel. (#16156 ) * Enable INT8 Concat Kernel to improve the performance of MobileNet-SSD. test=develop * Optimize UT format. test=develop * Fix UT file address issue. test=develop * Refine the license year. test=develop * Optimize code for new API. test=develop * Restructure INT8 Concat kernel. test=develop	6 years ago
Qiyang Min	171df5b56b	Merge pull request #16303 from junjun315/checkpoint for Checkpoint save and load	6 years ago
Hongyu Liu	e3bca9011c	Merge pull request #16357 from phlrain/fix_concat_check Fix concat check	6 years ago
Hongyu Liu	e5478ab5c8	Merge pull request #16346 from phlrain/add_floordiv_and_mod add elementwise floordiv, mod	6 years ago
chengduo	a6a3b2fbbc	[Speed]Refine ParallelExecutor (#16190 ) * refine parallelExecutor test=develop * Polish op_handle test=develop * Remove unnecessary op_handle test=develop * Fix Travis CI test=develop * Fix fetch bug test=develop * Remove WaitInputVarGenerated * Fix OpHandleBase::Run test=develop * debug test=develop * use origin fetch_op_handle test=develop * Revert op_handle_base.cc test=develop * Polish code test=develop * Fix OpHandleBase::Run test=develop * code refine * test CI and CE test=develop * fix OpHandle::Run test=develop * refine AllReduceOpHandle test=develop * Polish code test=develop	6 years ago
nhzlx	3df7b98a0f	Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD	6 years ago
nhzlx	f3a2e4b3d8	1. Add ANAKIN_ROOT compile option 2. refine trt code test=develop	6 years ago
phlrain	77a08750e9	add var name in optimizer; test=develop	6 years ago
chengduo	33965527fd	Add unit test for fuse all reduce (#16354 ) * refine fused_all_reduce_op * add unit test in test_parallel_executor_seresnext test=develop	6 years ago
Hongyu Liu	18a0f6d97a	Merge pull request #16351 from phlrain/fix_topk_shape_check Fix topk shape check	6 years ago
Hongyu Liu	15444430b0	Merge pull request #16348 from phlrain/fix_squeeze_check fix squeeze shape check	6 years ago
phlrain	5dc9b51994	fix time; test=develop	6 years ago
phlrain	686b8935fe	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_floordiv_and_mod	6 years ago
phlrain	18d107c27a	add floordiv and mod op; test=develop	6 years ago
phlrain	ff112813de	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_check	6 years ago
phlrain	8274d9d733	fix concat shape check; test=develop	6 years ago
Hongyu Liu	0d779f15f6	Merge pull request #16261 from phlrain/fix_sequence_pad_2 Fix sequence pad 2	6 years ago
Hongyu Liu	8c81d9949e	Merge pull request #16347 from phlrain/fix_matmul_check fix matmul shape check	6 years ago
phlrain	4e3b5e727d	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_check	6 years ago
sneaxiy	953214ad97	add more unittest modify allocator strategy remove changes of legacy buddy_allocator test=develop	6 years ago
Kaipeng Deng	5670530ca7	Revert "not use PERSISTENT in batch_norm. test=develop" (#16333 )	6 years ago
qingqing01	d2b938ef5a	Refine gradient proto maker and python API for affine_channel_op (#16340 ) * Rewrite gradient ProtoMaker for affine_channel_op to remove the Output as the input. * Add act in Python API to make the act can be in-place by layer_help.py	6 years ago
phlrain	b3d0336603	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_check	6 years ago
phlrain	249546bf45	fix concat shape check; test=develop	6 years ago
Tao Luo	294cdf6f48	Merge pull request #16177 from fc500110/remove_visualizer remove graph visualizer tool, which can be replaced by python IrGraph draw API	6 years ago
Hongyu Liu	b3b7e352ee	Merge pull request #16259 from phlrain/fix_lod_reset Fix lod reset	6 years ago
phlrain	3b46680913	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_squeeze_check	6 years ago
phlrain	6a5545aa22	fix squeeze shape check; test=develop	6 years ago
phlrain	d96f498bcf	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_topk_shape_check	6 years ago
chenjiawen	a859bfc31d	test=develop	6 years ago
phlrain	e7fb344c99	fix squeeze op shape check; test=develop	6 years ago
phlrain	190cfd692c	fix squeeze shape check; test=develop	6 years ago
chenjiawen	be6e78b7c4	test=develop;add approval RD check api	6 years ago
phlrain	0e40298949	fix matmul shape check; test=develop	6 years ago
phlrain	56c2d384c7	add elementwise floordiv, mod; test=develop	6 years ago
Wu Yi	b7baeed7bb	fix win gpu build test=develop (#16334 )	6 years ago
liuwei1031	df5d19aa9d	temoprarily disable the code of use kCUDNN, test=develop (#16205 ) * temoprarily disable the code of use kCUDNN, test=develop * add TODO comment, test=develop	6 years ago
ruri	09e05a110b	Merge pull request #16217 from ceci3/doc fix formula in dropout	6 years ago
zhhsplendid	124f1df481	Add flags for init and re-alloc gpu test=develop	6 years ago
Zhen Wang	8965819fbb	rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop	6 years ago
Wojciech Uss	cbe2dbf0db	Add enabling quantization (#16326 ) * Add enabling quantization test=develop * remove unused (here) function	6 years ago
lujun	09442fb27e	checkpoint pr be moved here, test=develop	6 years ago
Tao Luo	9a05859179	Merge pull request #16322 from wojtuss/wojtuss/fix_cpu_quantize_pass fix pattern maching conv2d with(out) ResidualData	6 years ago
qingqing01	8caa785e83	Enhance affine_channel_op infer-shape check (#16317 ) * Enhance affine_channel_op infer-shape check	6 years ago
flame	08838f3909	Fix save inference model bug (#16242 ) * save infer model bug fix, return target vars' name list	6 years ago
Kaipeng Deng	957ea995fc	Merge pull request #16243 from heavengate/batch_norm_not_persistent not use PERSISTENT in batch_norm. test=develop	6 years ago
nhzlx	4f4daa4b66	cherry-pick from feature/anakin-engine: add data type for zero copy #16313 1. refine anakin engine 2. add data type for zero copy align dev branch and PaddlePaddle:feature/anakin-engine brach the cudnn workspace modify was not included for now, because we use a hard code way in feature/anakin-engine branch. There should be a better way to implement it, and subsequent submissions will be made. test=develop	6 years ago
nhzlx	07dcf2856c	git cherry-pick from feature/anakin-engine: update anakin subgraph #16278	6 years ago
nhzlx	c407dfa3cb	cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276	6 years ago
nhzlx	a25331bc26	cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189	6 years ago
nhzlx	c79f06d3d8	cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178	6 years ago
nhzlx	69d37f81d7	cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157 support change input size	6 years ago
nhzlx	a1d200a5de	cherry-pick from feature/anakin-engine: Anakin support facebox #16111	6 years ago
flame	a32d420043	cherry-pick from feature/anakin-engine: batch norm (#16110 ) * use anakin batch norm and scale implement fluid batch norm	6 years ago
flame	0945b97f07	cherry-pick feature/anakin-engine: add anakin softmax/transpose/batch_norm/flatten/reshape op (#16020 ) * add anakin softmax/ flatten/reshape/transpose/batch_norm op converter	6 years ago
nhzlx	b21770a2aa	cherry-pick from feature/anakin-engine: Add subgraph fuse support and anakin engine #16018	6 years ago
nhzlx	084310f536	paddle-anakin: concat, split, pool2d converter#16003	6 years ago
flame	be523baad2	Add anakin conv2d/relu/sigmoid/tanh converter (#15997 ) * add activation op * test conv2d relu sigmoid tanh	6 years ago
Yan Chunwei	d0ce6a9044	fix anakin converter registry (#15993 )	6 years ago
Tao Luo	a5124ee0bb	Merge pull request #16301 from luotao1/runtime_context_pass add runtime_context_cache_pass	6 years ago
lujun	622fe6a56b	checkpoint pr be moved here, test=develop	6 years ago
baojun	2de263a5d9	Add softmax_with_cross_entropy_op to ngraph engine (#16304 ) * Add softmax_with_cross_entropy_op test=develop * simplify implementation test=develop	6 years ago
sneaxiy	bb166a1e10	fix API.spec test=develop	6 years ago
ruri	a3b8028d46	Merge pull request #16202 from shippingwang/add_sqrt_doc update sqrt explaination	6 years ago
phlrain	dd080b17c3	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_sequence_pad_2	6 years ago
phlrain	1580be5d6c	fix sequence pad; test=develop	6 years ago
dengkaipeng	aba2713ffc	fix comment. test=develop	6 years ago
chengduo	f26ba5bddd	Fuse AllReduce (#15921 ) * fuse all_reduce test=develop * add fuse_parameter_groups_size test=develop * Polish code test=develop * Fix travis-ci test=develop * Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize test=develop * Add SetGroupAccordingToMemorySize test=develop * fix multi_devices_graph test=develop * reset params_grads test=develop * Polish code test=develop	6 years ago
Zeng Jinle	d0ef682552	Merge pull request #16274 from sneaxiy/fix_grad_maker Remove unused variables in op grad maker	6 years ago
baojun	804afc51db	Minor ngraph fix (#16270 ) * take care edge cases test=develop * use pragma test=develop	6 years ago
Tao Luo	9195c3bb03	Merge pull request #16280 from luotao1/cos_sim_infershape refine cos_sim infershape	6 years ago
Wojciech Uss	104a9f1e27	fix pattern maching conv2d with(out) ResidualData test=develop	6 years ago
Wu Yi	6382b62f6b	Collective ops (#15572 ) * wip allreduce in op * wip * wip * wip * wip adding test * wip for conflict with mp mode * fix tests test=develop * fix cpu build test=develop * fix travis clang format test=develop * fix cpu build test=develop * update api.spec test=develop * delete comment test=develop * fix cpplint test=develop * fix test=develop * follow comment test=develop * add file test=develop * fix build test=develop * update test=develop * to be compatible with sync_bn, and fix mp mode in develop test=develop	6 years ago
lujun	bed0ecf3d2	checkpoint pr be moved here, test=develop	6 years ago
sneaxiy	023a3a3d62	fix op grad maker test=develop	6 years ago
luotao1	82af8031d9	add runtime_context_cache_pass test=develop	6 years ago
Zhen Wang	ec88b6cc5a	add channel wise quantization in ir pass.	6 years ago
Tao Luo	b9fc80a133	Merge pull request #16287 from PaddlePaddle/revert-16002-runtime_context Revert "cache runtime_context"	6 years ago
whs	18911b6eea	[enhence] Make step_input of dynamic_rnn support custom lod level. (#15972 ) * Make step_input support custom lod level. test=develop * Fix API.spec test=develop * Fix API.spec. test=develop * Fix API.spec test=develop * Add default value in document of step_input. test=develop * Fix document. test=develop * Fix API.spec test=develop	6 years ago
zhhsplendid	22715487dc	add allocator flags test=develop	6 years ago
luotao1	c05af910bc	refine cos_sim infershape test=develop	6 years ago
Hongyu Liu	d3acf68044	Merge pull request #16258 from phlrain/fix_concat_1 Fix concat	6 years ago
Tao Luo	7d2740db83	Revert "cache runtime_context"	6 years ago
sneaxiy	fd23262e0c	merge develop, fix conflict test=develop	6 years ago
ceci3	c79cdf258e	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into doc	6 years ago
ceci3	9d2ccec0f2	test=develop, fix doc	6 years ago
tensor-tang	ead558b7f6	Merge pull request #16256 from tensor-tang/refine/seqenum refine sequence enumerate op	6 years ago
ceci3	27f7a72641	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into doc	6 years ago
Qiyang Min	c7f1f3ed0c	Merge pull request #16214 from velconia/imperative_infer_var_type Implement imperative infer var type	6 years ago
Zeng Jinle	f8df9eb32e	fix api doc (#16201 ) test=develop	6 years ago
sneaxiy	3a09693f5c	change API name test=develop	6 years ago
Jacek Czaja	13816dd4ac	[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233 ) * - Fix to crash of Transformer when mkldnn is to be used Desc: TensorCopy was not setting MKLDNN primitive descriptor when layout was to be kMKLDNN test=develop * - Enable transformer for mkl-dnn test=develo * - Compilation fix test=develop * - Removed manual selection of MKL-DNN ops to be used in Transformer test test=develop	6 years ago
Yibing Liu	7e20e7691e	Fix the bug in fp16 backward kernel (#16269 ) test=develop	6 years ago
shippingwang	97c6051822	add api.spec, test=develop	6 years ago
Wojciech Uss	af03008890	Add cpu_quantize_placement_pass for C-API quantization (#16265 ) * Add cpu_quantize_placement_pass for C-API quantization test=develop * added a comment on required pass attributes test=develop	6 years ago
Tao Luo	dbb92ee4b1	Merge pull request #16002 from luotao1/runtime_context cache runtime_context	6 years ago
shippingwang	4f42504eef	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sqrt_doc	6 years ago
sneaxiy	e893cbd286	add auto increment best fit allocator test=develop	6 years ago
minqiyang	565b19b7a5	fix set data type bug test=develop	6 years ago
minqiyang	8364688c30	Fix py_func_op's problem	6 years ago
sneaxiy	161b8ddcaa	Merge develop	6 years ago
Zeng Jinle	6429d2a887	Merge pull request #16188 from sneaxiy/fix_const_cast Remove const_cast in optimizers	6 years ago
minqiyang	b40e41fbd1	Polish code style test=develop	6 years ago
Hongyu Liu	9874530ee8	Update lod_reset_op.cc	6 years ago
xiaolil1	e818fa1004	Enable INT8 transpose kernel for MobileNet-SSD improvement. (#16159 ) * Enable INT8 transpose kernel for MobileNet-SSD improvement. test=develop * Refine the license year. test=develop * Delete redundant code. test=develop * Add axis check. test=develop	6 years ago
Hongyu Liu	58c69a9eb9	Update lod_reset_op.cc	6 years ago
Xin Pan	374abcf361	Merge pull request #16247 from panyx0718/imperative add more imperative layer tests.	6 years ago
Hongyu Liu	96c54cd9b1	Update lod_reset_op.cc	6 years ago
Tao Luo	c072998ac1	Merge pull request #16219 from luotao1/fc_infershape refine fc_infershape	6 years ago
phlrain	e46a94a4a2	fix lod_rest; test=develop	6 years ago
phlrain	10ececb213	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_sequence_pad_2	6 years ago
phlrain	802b33489a	remove resize then seq num == 1; test=develop	6 years ago
phlrain	152beec540	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_lod_reset	6 years ago
tangwei12	8ea4218ce1	update load persistables for increment, test=develop (#15576 ) * update load persistables for increment, test=develop * update load persistables for increment, test=develop * update API Spec, test=develop * update API Spec, test=develop * add doc, test=develop * add doc, test=develop * Update lookup_table_utils.py * Update API.spec * Update lookup_table_utils.py test=develop * Update API.spec test=develop * fix api spec * Update lookup_table_utils.py test=develop	6 years ago
phlrain	a21fdde218	fix lod reset op;test=develop	6 years ago
phlrain	dcba2e7236	fix conncat; test=develop	6 years ago
phlrain	955fad7a90	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop	6 years ago
phlrain	a7fe3b508e	fix concat; test=develop	6 years ago
tensor-tang	50931dee1d	refine seq enum op test=develop	6 years ago
Qiyang Min	8e4ad008fb	Merge pull request #16198 from velconia/imperative_train_speed Improve imperative mode training speed	6 years ago
minqiyang	36dce65bb3	Take DataType and VarType apart test=develop	6 years ago
Xin Pan	3e9319f3ab	add more imperative layer tests. test=develop	6 years ago
luotao1	d9f0e7252a	refine with comments test=develop	6 years ago
luotao1	6fa52f83ba	Merge branch 'develop' into fc_infershape	6 years ago
luotao1	cc0ae1f1a1	refine with comments test=develop	6 years ago
dengkaipeng	9d435973d2	not use PERSISTENT in batch_norm. test=develop	6 years ago
luotao1	a275fd6e0c	Merge branch 'develop' into runtime_context	6 years ago
Wojciech Uss	2579ade45f	Add cpu_quantize_pass for C-API quantization (#16127 ) * Add cpu_quantize_pass for C-API quantization test=develop * add cpu_quantize_pass test * fix lint: add include memory unorderd_map and unordered_set test=develop * fuse_relu 1 test=develop * tuned 2 without squash * fixes test=develop * remove unused vars test=develop * refactored test=develop * fix lint c-style cast -> C++ style cast test=develop * remove QuantMax and c style casts test=develop * last usage of QuantMax removed test=develop * Fix Analysis Predictor UT Check if memory_optimize_pass has already been added to the analysis config before adding a new one, so that it is not added multiple times. test=develop * change map to unordered_map fix the forgotten part of cpu_quantize_pass_tester.cc test=develop * removed quantized attribute * fixed cpu_quantize_pass_tester and op attr comments test=develop * removed redundant line test=debug * removed gmock test=develop * fix after merge	6 years ago
Xin Pan	7458114b5b	Merge pull request #16228 from panyx0718/imperative graph neural network for imperative mode	6 years ago
chengduo	efca4de78e	Fix cross_entropy bug (#16236 ) test=develop	6 years ago
qingqing01	86e912c544	Fix windows compiling (#16230 ) test=develop	6 years ago
sneaxiy	4b073c95dc	fix compiler test=develop	6 years ago
Tao Luo	38898c2808	Merge pull request #16212 from Aurelius84/develop improve layers.fc api doc	6 years ago
minqiyang	db0c970823	Polish code test=develop	6 years ago
Kaipeng Deng	b77ebb2af2	Merge pull request #15919 from heavengate/yolo_box add yolo_box for detection box calc in YOLOv3	6 years ago
minqiyang	362253732c	Polish code test=develop	6 years ago
minqiyang	c0ddb93ccc	Polish code test=develop	6 years ago
minqiyang	b5078c211a	Make infer var type virtual test=develop	6 years ago
minqiyang	9041b238e3	Polish code test=develop	6 years ago
minqiyang	438bca9c3d	Implement Runtime Var Type Inference test=develop	6 years ago
Xin Pan	50ff898378	graph neural network for imperative mode test=develop	6 years ago
luotao1	5ecdc49c6b	set enable_runtime_context_cache_ default false test=develop	6 years ago
ceci3	cd906fc603	change api.spec	6 years ago
Zhaolong Xing	c49e604906	Merge pull request #16213 from qingqing01/compile_infer_shape Skip compile infer shape in box_coder_op	6 years ago
achao2013	81b4fad8b9	add moving average absmax op and fix bug (#15155 ) * Add moving average absmax op in quantilize-aware training.	6 years ago
luotao1	721c2c00ef	refine fc_infershape test=develop	6 years ago
Kaipeng Deng	74037cc1c8	Merge branch 'develop' into yolo_box	6 years ago
Xin Pan	92b9ce3479	Merge pull request #16073 from heavengate/yolov3_loss_imporve Yolov3 loss: add mixup score and label smooth	6 years ago
luotao1	46ee6bb1aa	fix distributed unit-tests test=develop	6 years ago
luotao1	1b59bed989	Merge branch 'develop' into runtime_context	6 years ago
Aurelius84	2d1e76fb0c	fix API.spec test=develop	6 years ago
luotao1	6ce25c99a0	Merge branch 'develop' into runtime_context	6 years ago
Aurelius84	6cfd20dea8	fix words spell error test=develop	6 years ago
qingqing01	8ad672a287	Support sync batch norm. (#16121 ) * Support Sync Batch Norm. * Note, do not enable it in one device. Usage: build_strategy = fluid.BuildStrategy() build_strategy.sync_batch_norm = True binary = fluid.compiler.CompiledProgram(tp).with_data_parallel( loss_name=loss_mean.name, build_strategy=build_strategy)	6 years ago
shippingwang	98d9552f0f	update sqrt explaination, test=develop	6 years ago
minqiyang	ca392c7e97	Implement infer var type context	6 years ago
Yibing Liu	4ae23cc3c5	Impl fp16 compute kernel for slice_op (#16206 ) * Impl fp16 compute kernel for slice_op test=develop * Use data() to replace mutable_data()	6 years ago
sneaxiy	f0d108f589	fix const_cast test=develop	6 years ago
Dang Qingqing	e5e7628a62	Skip compile infer shape in box_coder_op test=develop	6 years ago
Aurelius84	a59b7d47a8	improve layers.fc api doc test=develop	6 years ago
sneaxiy	3e03695629	fix numeric error test=develop	6 years ago
sneaxiy	5a92e4c097	revert revert 16144 test=develop	6 years ago
sneaxiy	e993effb29	include unordered_map to cross_entropy_op.cc test=develop	6 years ago
Zeng Jinle	a91964c8fe	Revert "PaddingRNN model memory optimize" test=develop	6 years ago
liuwei1031	1c6caf8466	1. disable reuse SELECTED_ROWS type variable (#16150 ) 2. remove lod check in reshape op test=develop	6 years ago
Wojciech Uss	b9252f3df8	Add cpu_quantize_squash_pass for C-API quantization (#16128 ) * Add cpu_quantize_squash_pass for C-API quantization test=develop * add cpu_quantize_squash_pass teste * fix lint: add include memory unorderd_map and unordered_set test=develop * lint fix 2 * fixes test=develop * refactored test=develop * fix windows ci test=develop	6 years ago
minqiyang	f83739499c	Polish code test=develop	6 years ago
minqiyang	7355d41834	1. Add imperative gperf profiler 2. Add binutils 2.27 in manylinux support test=develop	6 years ago
Zeng Jinle	0b49e43d3a	Merge pull request #16144 from sneaxiy/rnn_mem_opt PaddingRNN model memory optimize	6 years ago
luotao1	b2898c0f57	Merge branch 'develop' into runtime_context test=develop	6 years ago
minqiyang	98dfb492bb	Release GIL lock	6 years ago
sneaxiy	ac0e0f5181	merge develop test=develop	6 years ago
sneaxiy	a7a4f053da	Merge develop test=develop	6 years ago
Tao Luo	4ef6f738c3	Merge pull request #16154 from luotao1/infershape_example add all_kernels_must_compute_runtime_shape example for speedup infershape	6 years ago
tianshuo78520a	f404d53ba5	Api approvals (#16179 )	6 years ago
minqiyang	42e96a029f	Accelerate CPU part	6 years ago
sneaxiy	487624e15d	fix travis-ci test=develop	6 years ago
sneaxiy	0279020ba6	Merge develop test=develop	6 years ago
luotao1	1510b866b6	turn off runtime_context_cache for tensorrt test=develop	6 years ago
guomingz	decdbed054	resolve #15618 (#16114 ) * resolve #15618 Backgroud: the PR #15398 raised the box_coder op performance regression, we optimized the code via the more efficency leveraging opemmp.	6 years ago
sneaxiy	1e9fd40777	combine op files test=develop	6 years ago
sneaxiy	682f2dbf29	merge develop test=develop	6 years ago
sneaxiy	2c4fcaa683	merge develop	6 years ago
luotao1	d94fd97230	add runtime_context_cache_pass test=develop	6 years ago
Kaipeng Deng	1a4a90a81d	Merge pull request #16140 from tink2123/arc_function Add the inverse trigonometric function	6 years ago
Yan Xu	30568473ec	fix broadcast on mp mode (#15951 ) * fix broadcast with mp mode * polish code test=develop * fix bcast strategy test=develop * fic cpplint test=develop * fix py3 failed test=develop * fix comment test=develop * update comment test=develop	6 years ago
baojun	e3c37bd564	remove const_cast and refactor ngraph engine code (#15925 ) * remove concast_cast and refactor code test=develop * reduce flag use test=develop	6 years ago
fc500110	1c6e72b905	remove visualizer, which can be replaced by python IrGraph draw API	6 years ago
chengduo	0979956619	Add memory profiler (#16137 ) test=develop	6 years ago
luotao1	b561ad1e55	Merge branch 'develop' into runtime_context	6 years ago

... 2 3 4 5 6 ...

14510 Commits (98dda08a8535d6faa1442dd6452d7ce5d035712d)