Paddle

Commit Graph

Author	SHA1	Message	Date
Leo Zhao	54636a1982	call SetNumThreads everytime to avoid missing omp thread setting (#17224 ) * call SetNumThreads everytime to avoid missing omp thread setting resolve #17153 test=develop * add paddle_num_threads into config for test_analyzer_pyramid_dnn resolve #17153 test=develop	6 years ago
wopeizl	83c4f7721f	use two GPUs to run the exclusive test test=develop (#17187 )	6 years ago
luotao1	490e746269	fix runtime_context_cache bug when gpu model has an op runs only on cpu test=develop	6 years ago
wopeizl	d9991dccdd	add parallel build script to ci … (#16901 ) * add parallel build script to ci test=develop * 1. classify the test case as single card/two cards/multiple cards type 2. run test case according to the run type	6 years ago
Tao Luo	aa7b975bf6	disable runtime_context_cache pass by default test=develop	6 years ago
Tao Luo	ca8b8fa0bd	Merge pull request #16830 from Superjomn/fix/tmp-memory-optim fix memory optim temporarily	6 years ago
lijianshe02	de26df440b	add SaveOptimModel interface in analysis_predictor.h and test it in a… (#16441 ) * add SaveOptimModel interface in analysis_predictor.h and test it in analyzer_dam_tester and analyzer_resnet50_tester test=develop	6 years ago
superjomn	f58c3ec189	fix memory optim temporarily test=develop	6 years ago
liuwei1031	85363848a1	Security issue (#16774 ) * disable memory_optimize and inpalce strategy by default, test=develop * fix security issue http://newicafe.baidu.com:80/issue/PaddleSec-3/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-8/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-12/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-32/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-35/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-37/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-40/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-43/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-44/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-45/show?from=page test=develop * revert piece.cc, test=develop * adjust api.cc,test=develop	6 years ago
tensor-tang	d6c1b5a73b	disable seqpool concat pass by default saving CI time test=develop	6 years ago
luotao1	226596a296	Merge branch 'develop' into core_opt_choose_kernel	6 years ago
luotao1	bd636a9ea6	test_analyzer_int8 tests use default pass order test=develop	6 years ago
Yan Chunwei	044ae2497d	fix identity temporarily (#15942 )	6 years ago
Wojciech Uss	ec2750b3c2	fix repeating passes (#16606 )	6 years ago
Wojciech Uss	9b6a029666	fix dataset reading and add support for full dataset (#16559 )	6 years ago
石晓伟	5dea0bdd1b	Merge pull request #16498 from Shixiaowei02/feature/anakin-engine merge feature/anakin-engine to develop	6 years ago
Shixiaowei02	bddb2cd315	resolve conflicts with the develop branch test=develop	6 years ago
nhzlx	d065b5bf2b	Anakin ssd support refine trt first run add quant dequant fuse pass omit simplify_anakin_priorbox_detection template omit transpose_flatten_concat_fuse template test=develop	6 years ago
Michał Gallus	2d8b7b3a76	Refine default MKL-DNN Pass order (#16490 ) * Refine default MKL-DNN Pass order test=develop * Add comment to default MKL-DNN Pass list test=develop	6 years ago
Wojciech Uss	09dfc7a2aa	C-API quantization core 2 (#16396 ) * C-API quantization core test=develop Co-authored-by: Sylwester Fraczek <sylwester.fraczek@intel.com> * Decouple Quantizer from AnalysisPredictor test=develop * fixes after review test=develop * renamed mkldnn quantize stuff test=develop * remove ifdef from header file test=develop	6 years ago
nhzlx	953bdde058	Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD test=develop	6 years ago
nhzlx	45b3766fdf	fix comments test=develop	6 years ago
liuwei1031	de3b70a101	fix cdn issue, test=develop (#16423 ) * fix cdn issue, test=develop * fix cdn issue, test=develop	6 years ago
nhzlx	3df7b98a0f	Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD	6 years ago
nhzlx	f3a2e4b3d8	1. Add ANAKIN_ROOT compile option 2. refine trt code test=develop	6 years ago
luotao1	056599a738	add expected_kernel_cache_pass test=develop	6 years ago
Wojciech Uss	cbe2dbf0db	Add enabling quantization (#16326 ) * Add enabling quantization test=develop * remove unused (here) function	6 years ago
nhzlx	4f4daa4b66	cherry-pick from feature/anakin-engine: add data type for zero copy #16313 1. refine anakin engine 2. add data type for zero copy align dev branch and PaddlePaddle:feature/anakin-engine brach the cudnn workspace modify was not included for now, because we use a hard code way in feature/anakin-engine branch. There should be a better way to implement it, and subsequent submissions will be made. test=develop	6 years ago
nhzlx	07dcf2856c	git cherry-pick from feature/anakin-engine: update anakin subgraph #16278	6 years ago
nhzlx	c407dfa3cb	cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276	6 years ago
nhzlx	a25331bc26	cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189	6 years ago
nhzlx	c79f06d3d8	cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178	6 years ago
nhzlx	69d37f81d7	cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157 support change input size	6 years ago
nhzlx	a1d200a5de	cherry-pick from feature/anakin-engine: Anakin support facebox #16111	6 years ago
nhzlx	b21770a2aa	cherry-pick from feature/anakin-engine: Add subgraph fuse support and anakin engine #16018	6 years ago
luotao1	82af8031d9	add runtime_context_cache_pass test=develop	6 years ago
Tao Luo	7d2740db83	Revert "cache runtime_context"	6 years ago
luotao1	a275fd6e0c	Merge branch 'develop' into runtime_context	6 years ago
Wojciech Uss	2579ade45f	Add cpu_quantize_pass for C-API quantization (#16127 ) * Add cpu_quantize_pass for C-API quantization test=develop * add cpu_quantize_pass test * fix lint: add include memory unorderd_map and unordered_set test=develop * fuse_relu 1 test=develop * tuned 2 without squash * fixes test=develop * remove unused vars test=develop * refactored test=develop * fix lint c-style cast -> C++ style cast test=develop * remove QuantMax and c style casts test=develop * last usage of QuantMax removed test=develop * Fix Analysis Predictor UT Check if memory_optimize_pass has already been added to the analysis config before adding a new one, so that it is not added multiple times. test=develop * change map to unordered_map fix the forgotten part of cpu_quantize_pass_tester.cc test=develop * removed quantized attribute * fixed cpu_quantize_pass_tester and op attr comments test=develop * removed redundant line test=debug * removed gmock test=develop * fix after merge	6 years ago
luotao1	5ecdc49c6b	set enable_runtime_context_cache_ default false test=develop	6 years ago
luotao1	1510b866b6	turn off runtime_context_cache for tensorrt test=develop	6 years ago
luotao1	d94fd97230	add runtime_context_cache_pass test=develop	6 years ago
luotao1	1283833395	zero_copy tensor support INT32 test=develop	6 years ago
luotao1	31c4e1d9fc	Merge branch 'develop' into zero_copy	6 years ago
Tao Luo	e5e7e9b865	Merge branch 'develop' into transformer_ut	6 years ago
Tao Luo	6f2581e4c5	Merge pull request #16090 from lidanqing-intel/paddle-int32 Add PaddleDType INT32 support	6 years ago
Zhaolong Xing	3d63aa0a11	Merge pull request #15729 from NHZlX/add_static_model_load_for_trt Four points for enhancing Paddle-TRT	6 years ago
nhzlx	a9ed427749	cant not pass ci add if use static engine for trt test=develop	6 years ago
luotao1	fad06cb928	unify ZeroCopy in analysis_test	6 years ago
lidanqing	4aeb261da9	Add INT32 support. INT32 in last switch case test=develop	6 years ago
luotao1	06aab1b493	refine SetCpuMathLibraryNumThreads test=develop	6 years ago
nhzlx	3c40cb767b	7 refine zero copy update trt in docker file test=develop	6 years ago
Yiqun Liu	1616c32acf	Add the include of cudnn.h to enable the use of CUDNN_VERSION. (#15961 ) test=develop	6 years ago
nhzlx	2eff3e26b6	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_static_model_load_for_trt	6 years ago
nhzlx	06a088a199	fix comments and fix cpplint test=develop	6 years ago
nhzlx	0ed63b2108	6. delete useless predictor id test=develop	6 years ago
Sylwester Fraczek	1943119fc5	fix typo memeroy->memory test=develop	6 years ago
Sylwester Fraczek	8bc604571f	fix typo seriazlized->serialized	6 years ago
Sylwester Fraczek	543e53db05	fix typo releated->related	6 years ago
tensor-tang	e1c707fe9c	fix warnings (#15790 ) * fix warnings test=develop * fix enforce test test=develop	6 years ago
nhzlx	2070fb246d	4. do the trt_engine optim during init. add simple static mode loading test=develop	6 years ago
Yan Chunwei	3a5d6e5e64	move passes to src to avoid different behavior in deployment (#15705 )	6 years ago
Yan Chunwei	c00ed19df2	add more comment (#15603 )	6 years ago
Gabor Buella	da9c94da33	Clang build fixes (#15628 ) * Remove some superfluous std::move calls The std:move triggered a build error (with -Werror): ``` [ 9%] Building CXX object paddle/fluid/memory/allocation/CMakeFiles/allocator_facade.dir/allocator_facade.cc.o /home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: error: moving a temporary object prevents copy elision [-Werror,-Wpessimizing-move] [this] { return std::move(CreateAllocatorWithChunk()); }, capacity); ^ /home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: note: remove std::move call here [this] { return std::move(CreateAllocatorWithChunk()); }, capacity); ^~~~~~~~~~ ~ 1 error generated. ``` See: https://reviews.llvm.org/D7633 * Remove a superfluous lambda capture from framework/operator.h ``` [ 10%] Building CXX object paddle/fluid/platform/CMakeFiles/device_context.dir/init.cc.o In file included from /home/tej/code/gbuella_paddle/paddle/fluid/platform/init.cc:19: /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.h:229:21: error: lambda capture 'this' is not used [-Werror,-Wunused-lambda-capture] [this](Variable* var) { return var; }); ^~~~ 1 error generated. ``` Changing it to `return it->second;`, as is in the function below. * Rethrow an exception (instead of copying it) ``` [ 11%] Building CXX object paddle/fluid/framework/CMakeFiles/operator.dir/operator.cc.o /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: error: local variable 'exception' will be copied despite being thrown by name [-Werror,-Wreturn-std-move] throw exception; ^~~~~~~~~ /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: note: call 'std::move' explicitly to avoid copying throw exception; ^~~~~~~~~ std::move(exception) ``` See https://reviews.llvm.org/D43322 for an explanation of this diagnostic message. * Remove an unused variable ``` /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:884:16: error: private field 'scope_' is not used [-Werror,-Wunused-private-field] const Scope& scope_; ^ ``` * struct ComputationOpHandle -> class ComputationOpHandle ``` [ 13%] Building CXX object paddle/fluid/framework/details/CMakeFiles/memory_early_delete_pass.dir/memory_early_delete_pass.cc.o In file included from /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/memory_early_delete_pass.cc:21: /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: error: class 'ComputationOpHandle' was previously declared as a struct; this is valid, but may result in linker errors under the Microsoft C++ ABI [-Werror,-Wmismatched-tags] class ComputationOpHandle; ^ /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/computation_op_handle.h:29:8: note: previous use is here struct ComputationOpHandle : public OpHandleBase { ^ /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: note: did you mean struct here? class ComputationOpHandle; ^~~~~ struct 1 error generated. ``` * Fix name() methods under fluid/operators ``` In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.cc:15: In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.h:19: /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/jitcode.h:71:23: error: 'name' overrides a member function but is not marked 'override' [-Werror,-Winconsistent-missing-override] virtual const char* name() const = 0; ^ /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen_base.h:31:23: note: overridden virtual function is here virtual const char* name() const = 0; ^ ``` test=develop	6 years ago
Chunwei	d85c2e4e5c	fix anakin compile dependency test=develop	6 years ago
qingqing01	943d972878	Fix analysis predictor when loading the persistable RAW type variable. (#15613 )	6 years ago
Yan Chunwei	e887d71958	fix ir debug config (#15571 )	6 years ago
Yan Chunwei	897789b16e	fix save_inferece_model bug (#15365 )	6 years ago
Tao Luo	3d0ecab41b	add analyzer_transformer_test test=develop	6 years ago
Yan Chunwei	655179089f	AnalysisConfig remove contrib namespace (#15540 )	6 years ago
qingqing01	a6910f900e	Always create variables in analysis_predictor before OptimizeInferenceProgram. (#15533 ) Otherwise, some other persistable variable (like RAW type) will not be created	6 years ago
Yan Chunwei	b62b756b28	add version support (#15469 )	6 years ago
Yan Chunwei	526790e652	infer get program (#15511 )	6 years ago
Zhaolong Xing	97b76c94c4	Merge pull request #15242 from NHZlX/trt_int8_ultimate_version add trt int8 support	6 years ago
Yan Chunwei	e2818c8608	add dynamic memory optim (#15457 )	6 years ago
nhzlx	92cf4a4c6b	fix comments test=develop	6 years ago
nhzlx	027d24c831	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version	6 years ago
nhzlx	9641324995	fix comments test=develop	6 years ago
nhzlx	484b3bc801	When cudnn version < 7100, there is problem with conv_fusion. Add check for it. test=develop	6 years ago
flame	d60751fb71	add python inference api (#15248 ) add python inference api	6 years ago
Yan Chunwei	885c4e57ab	fea/infer memory optim2 (#14953 )	6 years ago
Yan Chunwei	c9e5aa19c1	get tensor API add more comments (#15345 )	7 years ago
Yan Chunwei	e84234b551	make clone thread safe (#15363 )	7 years ago
Zhaolong Xing	236201c222	Merge pull request #15350 from NHZlX/fix_bug_for_precditor fix analysis config bug	7 years ago
Yan Chunwei	e07900d317	cache tensor ptr in ZeroCopyTensor (#15352 )	7 years ago
Yan Chunwei	b7916440ff	hot fix the Native clone (#15344 )	7 years ago
nhzlx	b95f2ff8fe	fix win build bug test=develop	7 years ago
nhzlx	b938324381	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version test=develop	7 years ago
nhzlx	312fe0ece1	add trt int8 calibration support fix comments test=develop	7 years ago
Yiqun Liu	568cc2ffa8	Optimize while_op for test (#14764 ) * Simplify the compare op for CPU. * Use asynchronous tensor copy in reshape_op's kernel. * Optimize while_op for test, avoiding creating variables every time. test=develop * Enable the cache of kernel type and kernel function. test=develop * Enable profiling with gperftools. * Remove flags for testing, and fix the linking error. test=develop * Delete the codes of ChooseKernel. test=develop * Fix bug when preparing ExecutorPrepareContext for while_op. * Fix missing depending on grpc libraries. * Remove the redundant print. test=develop * Follow comments. * Remove the codes related to prepare the ExecutorPrepareContext for while_op. test=develop	7 years ago
nhzlx	b2ba3471fd	fix analysis config bug.	7 years ago
tensor-tang	a5d2a6d1ad	add fuse pass of sequared mat sub fusion	7 years ago
tensor-tang	a89296ac1f	add repeated fc relu pass	7 years ago
Zhaolong Xing	98e85f3735	add_transpose_flatten_concat_fuse (#15121 )	7 years ago
wopeizl	5d9edb4124	Merge pull request #15156 from wopeizl/windows/fixgpuissue fix gpu buils issue on windows test=develop	7 years ago
tensor-tang	146e942c65	Merge pull request #15250 from tensor-tang/refine/seqpool/feed Refine/seqpool/feed with infer zerocopytensor	7 years ago
peizhilin	439691f5bd	adjust the shlwapi on windows test=develop	7 years ago
tensor-tang	ce909664d8	Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed	7 years ago
peizhilin	e239558e56	remove the dismatch enclosure to avoid warning message test=develop	7 years ago
Tao Luo	2b11c710b3	Merge pull request #15249 from NHZlX/fix_trt_demo_ci fix demo ci bug	7 years ago
tensor-tang	137060135e	fix zerocopy size	7 years ago
nhzlx	e7d83389e6	fix demo ci bug 1. trt_demo bug 2. trigger exit when exists a bug test=develop	7 years ago
nhzlx	4e3522e5b4	add trt int8 support test=develop	7 years ago
tensor-tang	72d2a1801e	add seqpool concat fuse pass test=develop	7 years ago
Yan Chunwei	d09d6eadc0	make inference api work with Doxygen (#15195 )	7 years ago
Yan Chunwei	875a07c32d	refactor inference analysis api (#14634 )	7 years ago
tensor-tang	516fe301ee	add comment in case of empty name test=develop	7 years ago
tensor-tang	dca68cdf97	throw error when name not find test=develop	7 years ago
tensor-tang	cd94df8679	fix load and refine	7 years ago
Zhaolong Xing	4048cfa9da	Merge pull request #15048 from NHZlX/add_affine_channel_fuse Add conv+ affine channel fuse pass	7 years ago
Zeng Jinle	c0bcff00dc	Merge pull request #14962 from sneaxiy/rewrite_variable_type Rewrite variable type	7 years ago
Tao Luo	05f1b65da3	simplify prepere_input in analyzer_test test=develop	7 years ago
nhzlx	02e17396c2	fix comments test=develop	7 years ago
nhzlx	71636e677d	add min_subgraph_size attr to tensorrt config test=develop	7 years ago
sneaxiy	dde3afe7b7	Merge develop test=develop	7 years ago
nhzlx	73b47df1f4	Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_affine_channel_fuse test=develop	7 years ago
nhzlx	ce3782c193	add affine_channel fuse. fix conv+elemenwise fuse bug.	7 years ago
qingqing01	51a9fca323	Async memory copy (#15013 )	7 years ago
sneaxiy	ae6f46a1a9	rewrite variable type test=develop	7 years ago
peizhilin	07c7eaabb4	Merge remote-tracking branch 'upstream/develop' into windows/mkl test=develop	7 years ago
Zhaolong Xing	a9fb34fad8	Merge pull request #14903 from NHZlX/add_conv_elementwise_pass Add conv + elementwiseAdd pass	7 years ago
peizhilin	5a6d7fe2ff	add mkl,ctc support for windows	7 years ago
wopeizl	0f085f0a5a	Merge pull request #14892 from wopeizl/windows/port3 fix script issue	7 years ago
nhzlx	fcc93d96d5	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_conv_elementwise_pass fix conflicts test=develop	7 years ago
Yu Yang	bacf1d2399	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type	7 years ago
nhzlx	4e4a777243	add conv+elementwiseadd pass test=develop	7 years ago
Yan Chunwei	a985949be9	Fea/fuse conv elementwise add fuse (#14669 )	7 years ago
Yu Yang	04a570b463	Fix ut test=develop	7 years ago
peizhilin	23dec78772	fix script issue test=develop	7 years ago
Yu Yang	9bd70a1e04	Change tensor uses proto::VarType::type test=develop	7 years ago
bingyanghuang	943ad4781f	One possible solution to add flexibility for mkldnn placement pass (#14768 ) * Choose to turn on use_mkldnn attribute v1 * Fix mkldnn_op empty bug * format change test=develop * fix ci test=develop * fix ci test and add test in dam test=develop * add example to dam compare test test=develop * review changes test=develop	7 years ago
Yihua Xu	3821fc3950	Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl test=develop	7 years ago
Tao Luo	743cb840f1	update with comments test=develop	7 years ago
Tao Luo	42359e88a4	clean code test=develop	7 years ago
Tao Luo	405b2486db	support loading from memory test=develop	7 years ago
Xin Pan	7e0801d4ed	Merge pull request #14441 from baojun-nervana/intel/ngraph_op Implementing ngraph engine	7 years ago
Yihua Xu	64e261c6cd	Implement the fusion of convolution and bias for mkldnn (test=develop)	7 years ago
Yan Chunwei	4b7617740e	fix container not cleared (#14231 )	7 years ago
nhzlx	49c28b8c52	Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass test=develop	7 years ago
Sang Ik Lee	24e70920db	Refactor some build settings. test=develop	7 years ago
Sang Ik Lee	d6125a5eec	Include ngraph in inference demo build. test=develop	7 years ago
Tao Luo	b4de023ee1	Merge pull request #14636 from Superjomn/fix/word2vec fix word2vec bug	7 years ago
nhzlx	d3e140a572	Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass test=develop	7 years ago
nhzlx	900fbb83f9	add params sync pass	7 years ago
superjomn	9c665c81ae	update test=develop	7 years ago
minqiyang	a02ce58f2c	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog test=develop	7 years ago
Yiqun Liu	726f2cefe3	Fix bug of referencing a temporary variable. (#14614 ) test=develop	7 years ago
minqiyang	be04d99fe4	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog test=develop	7 years ago
minqiyang	53433d7f2e	Revert the changes of VLOG test=develop	7 years ago
peizhilin	36cd18b549	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
qingqing01	39ec80def4	Remove the memory copy of feeding data in C++ inference API (#14577 ) * Remove the memory copy for feeding data in C++ inference API * Fix compling dependence * Fix compling in ONLY_CPU mode	7 years ago
peizhilin	1afa9492af	Recover the profiler	7 years ago
Yiqun Liu	bf222f197d	Use sub scope in tensor_array_to_tensor op. (#14524 ) test=develop	7 years ago
dzhwinter	840c1b29ad	test=develop (#14562 ) * test=develop remove code. * test=develop	7 years ago
luotao1	116979a40a	refine api name test=develop	7 years ago
luotao1	a5c4b463c9	add SetMKLDNNThreadId api	7 years ago
luotao1	e21edb26f6	add Set/GetCPUNumThreads api	7 years ago
peizhilin	7c8c9dc9bf	fix unit test cases	7 years ago
wopeizl	d9a1f3e58e	Windows/online (#14474 ) * add recordio support * disable the openblas multi-thread on windows since no support adjust the python script * code style * code style test=develop * add create_recordio_file_reader back * fix code style test=develop * fix the gtest.cmake on windows * fix cc_test on windows * fix the win build test=develop * remove fused compile support on windows test=develop * add the jit support test=develop * add the jit support, test=develop * add the jit support, test=develop * add the jit back fix compile error on windows * rollback test=develop * test case fix * disable DSO by default on windows * exclude warpctc_op on windows * exclude the dynload_warpctc out on windows test=develop * fix the scripts error test=develop * disable avx on windows by default test=develop * re-organize the cmake file * disable mkl on windows by default * add warp_ctc back * fix the dependency * fix the dependency * fix the build issue on windows * remove unsupported flag on windows * code style * code style test=develop * fix issue * add profiler, parallel_executor back * clean up the pre-definitions on windows * fix build issue * test=develop	7 years ago
peizhilin	6e66fadb95	clean up the pre-definitions on windows	7 years ago
nhzlx	a4dc1d4292	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_trt test=develop	7 years ago
nhzlx	faeb9b8aa9	fix compile rely problem	7 years ago
nhzlx	b742d46520	fix demo ci bug on trt	7 years ago
hjchen2	a8c077df7c	Implement leaky relu tensorRT converter	7 years ago
Superjomn	e878a8e885	update test=develop	7 years ago
superjomn	4bf6817cbc	fix gpu load model the parameters will load from CPUPlace, that will keep copying data between CPU and GPU places. test=develop	7 years ago
Zhaolong Xing	2f27c048cc	Merge pull request #14440 from hjchen2/develop Add PRelu tensorRT plugin and Conv2d transpose op converter	7 years ago
hjchen2	21f33b4274	Complete PRelu plugin and Conv2d transpose op converter	7 years ago
Sylwester Fraczek	8a1eeec579	add mkldnn prop_kind phase for inference-only case to pooling and activations (#14278 ) * add is_test to pooling and activations add prop_kind support for layers activation. conv and pooling add a pass that sets is_test to true add transpiler version of is_test pass test=develop * patch test and pass test=develop * add pass to analyzer.h test=develop * add is_test attr description & pass only on mkldnn in: activation_op.cc batch_norm_op.cc conv_op.cc dropout_op.cc lrn_op.cc pool_op.cc sequence_pool_op.cc softmax_op.cc * fix is_test handling for activation pool and conv * change description of is_test for all layers again * remove GetAttr(use_mkldnn) from pass * rename correct_mkldnn_test_phase to is_test and remove dependency on MKLDNN test=develop * review fix magic number * two if(..)s into one * Check is_test once and pass mkldnn forward prop kind * dereference shared_ptr with * (without get()) test=develop * add is_test_pass back test=develop	7 years ago
dzhwinter	d3aed98d86	Merge pull request #14320 from wopeizl/windows/online Windows/online	7 years ago
Yiqun Liu	9e6b1c5f97	Refine tester of TensorRT engine (#14390 ) * Refine the tester for MixedRTPredictor. test=develop * Enable the profiler in TensorRT engine. * Support the use of combined inference model in TensorRT unittest, and print the shape of feed targets.	7 years ago
peizhilin	1a9008c420	code style fix test=develop	7 years ago
nhzlx	ddb120357c	Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_trt_plugin merge develop and fix conflicts	7 years ago
peizhilin	30ddc07a7e	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
Yan Chunwei	9f252e0032	Combine Inference Analysis with IR (#13914 )	7 years ago
nhzlx	d38fd6a0fc	add plugin support and offer an simple split sample	7 years ago
peizhilin	ca60e1d34d	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
peizhilin	52f7644f53	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
Qiyang Min	698698f2fa	Merge branch 'develop' into fix_vlog	7 years ago
qingqing01	abe209234f	Exhaustive search for cuDNN conv. (#14286 ) * exhaustive search for cuDNN conv. * Refine code and add unit testing. * Fix model load in fluid/inference and unit testing in conv2d * Follow comments. * Fix compiling test=develop	7 years ago
minqiyang	87450b9ad4	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog test=develop	7 years ago
peizhilin	4ffa92d4f0	Merge branch 'develop' into windows/build	7 years ago
Tao Luo	813e54efbd	Merge pull request #14328 from PaddlePaddle/revert-14046-windows/debug Revert "cherry picked windows patches."	7 years ago
minqiyang	3db9fad764	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog test=develop	7 years ago
minqiyang	3da43dcae2	Because anakin do NOT use glog, so we revert anakin related change test=develop	7 years ago
Tao Luo	387610aae1	Merge pull request #14325 from luotao1/fix_test_analysis_predictor fix test_analysis_predictor	7 years ago
peizhilin	45125ba538	fix share library issue	7 years ago
Zhaolong Xing	ba8b5619a3	Revert "cherry picked windows patches."	7 years ago
minqiyang	fcc0452c8b	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog test=develop	7 years ago
Tao Luo	381bea0a16	fix test_analysis_predictor test=develop	7 years ago
minqiyang	0c3227a523	Change the origin VLOG level to 10 times Fix code to support cpplint syntax check test=develop	7 years ago
peizhilin	869487a2b7	Merge remote-tracking branch 'origin/develop' into windows/build	7 years ago
dzhwinter	2835e04409	merge develop branch. test=develop	7 years ago
qingqing01	db8c52da5e	Revert " Exhaustive search for cuDNN conv. (#14043 )" This reverts commit `ce7d9b0799`.	7 years ago
qingqing01	ce7d9b0799	Exhaustive search for cuDNN conv. (#14043 ) * exhaustive search for cuDNN conv. * Refine code and add unit testing. * Clean code * Fix model load in fluid/inference and unit testing in conv2d * Follow comments.	7 years ago
peizhilin	9d67c1fb69	cpu build support	7 years ago
dzhwinter	60f70b174d	test=develop	7 years ago
dzhwinter	cc02353d10	test=develop	7 years ago
dzhwinter	eb2f7ed21b	refine tests. test=develop	7 years ago
Tao Luo	fe8f178582	fix word2vec related inference unit-tests (#14203 )	7 years ago

... 2 3 4 5 6 ...

539 Commits (ad6e3dd69cd915dd61287e96de7ec4ae132d24a5)