Paddle

Commit Graph

Author	SHA1	Message	Date
Bai Yifan	e2c6bada36	Support dispensable student_loss in PaddleSlim distillation (#19824 ) * support_dispensable_student_loss, test=develop * add distillation test, test=develop * fix distillation test non convergence problem, test=develop * fix test_distillation fail problem, test=develop	6 years ago
Bai Yifan	aa63d5ac6d	Make the distillation process not save teacher variables in PaddleSlim (#19633 ) * split teacher checkpoints with student checkpoints, test=develop * add unittest for graph.merge(), test=develop	6 years ago
gongweibao	6c2bc29cc0	Fix float16 optimizer. (#19682 ) Fix float16 optimizer	6 years ago
Chen Weihang	73daa3d6c0	Code Cleanup: delete three useless raw variables in Conv2D (#19644 ) * delete useless raw variables in Conv2D, test=develop * adjust the vars number in test_graph_wrapper to pass unittest, test=develop	6 years ago
Jie Fang	c6a598a276	init new amp, optimize inserting cast op for batchnorm (#18596 ) init new amp, optimize inserting cast op for batchnorm	6 years ago
123malin	2f037c3189	fix the diff between async mode and async_half mode (#19535 ) * test=develop, communicator merge add => merge average	6 years ago
whs	1c2aae567a	Skip start epoch and end epoch when dumping strategy in PaddleSlim (#19580 ) test=develop	6 years ago
gongweibao	abaf87be2b	Change backward_guard to optimize_guard to maximize the allreduce overlap. (#19506 ) Change backward_guard to optimize_guard to maximize the allreduce overlap	6 years ago
Zeng Jinle	5dce1da680	remove reset recordio usage (#19519 )	6 years ago
lidanqing	ba368bf696	clean up intel labeled TODOs (#19476 ) test=develop	6 years ago
Leo Chen	6fb310ae29	Fix bug of getting bool Flags from os.environ (#19349 ) * fix bug of getting bool Flags from os.environ, test=develop * add empty loss_name in CompiledProgram for inplace grad test, test=develop	6 years ago
liu zhengxi	32598ffd8f	Python infer api update and add unit test (#19353 ) * python inference api supports numpy and add unit test, fix unit test fail in test_slim_int8_googlenet and test_slim_int8_mobilenet	6 years ago
Jiancheng Li	1799c257ad	Update Light-NAS to support latency-aware search (#19050 ) * update light_nas_strategy: add latency constraint test=develop * update light_nas_strategy: update get_model_latency test=develop * update light_nas_strategy: add more check test=develop * update light_nas test test=develop * update light_nas test test=develop * minor update light_nas test test=develop * minor update light_nas test test=develop * update light_nas test test=develop * update _constrain_func of light_nas_strategy test=develop * update _constrain_func of light_nas_strategy test=develop * remove unused code test=develop	6 years ago
Zhen Wang	0fe72469ea	Add the max-pool2d quantization support and the partial quantization support. (#19310 ) * add pool2d quantization support, only for max-pooling. * add the partial quantization support.	6 years ago
Tao Luo	2f8c7e021f	remove unused inference_transpiler unit-tests (#19130 ) * remove unused inference_transpiler unit-tests test=develop * remove InferenceTranspiler usage in quantize_transpiler.py test=develop	6 years ago
yaoxuefeng	9150cf50fc	add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871 ) * add ctr related metric layer test=develop * add save cache and slots shuffle test=develop * add save cache and slots shuffle test=develop * fix error * fix error * fix style for ci * fix for comments * change SlotsShuffle input to std::strinf for generality * fix style * fix style * fix style * fix style * fix style * fix style * fix stylr * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * change non-const reference to pointer * fix style * fix style * fix style test=develop * fix style test=develop * add return ins num in ctr metric op * change dtype to float in metric_op.py * fix error test=develop * fix style test=develop * fix API spec * fix API spec * fix API spec test=develop * add UT test=develop	6 years ago
chengduo	01c7daade7	Add checking for the fetch_list of Executor.run (#18957 ) * update exe.run	6 years ago
whs	c92b78b060	Fix unitest of light nas. (#18931 ) test=develop	6 years ago
tangwei12	d845848341	do some odd jobs (#18641 ) do some odd jobs, test=develop	6 years ago
guru4elephant	5ed713d519	remove ctr reader, all functions are satisfied in dataset (#18672 ) * remove ctr reader, all functions are satisfied in dataset	6 years ago
bingyanghuang	a25be53cb5	QAT int8 MKL-DNN transformation pass with MUL (#18322 )	6 years ago
Jie Fang	2b4ef509ea	init custom black white list (#18377 ) test=develop	6 years ago
Wojciech Uss	8ed819d8e4	Call the test_slim_int8_* tests through absolute path (#18386 ) test=develop	6 years ago
翟飞跃	19da59ed3f	Remove all the code, API and doc of MKL-DNN INT8v1 (#18347 )	6 years ago
whs	1bdfd2eb85	Fix checkpoint of Light-NAS (#18330 ) Socket can't be pickled. test=develop	6 years ago
Hongyu Liu	df2eee71d8	Sequence mask support tensor (#18249 ) * sequnce mask support max length tensor input; test=develop * add rnn_impl.py; test=develop * add basic gru lstm unittest; test=develop * fix api spec; test=develop * fix sequence_mask op bug; test=develop test=document_preview * change +-x to elmentwise_op; test=develop add mkl flag; test=develop * fix rnn impl bug; test=develop * update api spec; test=develop * fix doc bug; test=develop * fix lstm bugs; test=develop	6 years ago
Jie Fang	172c2facef	init black/white lists (#17847 ) test=develop	6 years ago
songhao	6b3d96254d	fix some bug when merge sparse embedding parameters, test=develop (#18223 ) 1. fix the bug that out_put_var in SaveSelectedRows would be empty string 2. use merge_sparse_lookup_table to replace sum op for load_persistables_for_inference 3. fix the bug in _clone_var_in_block_ when the var is SELECTED_ROWS.	6 years ago
Wojciech Uss	36b60e240a	Enable MKL-DNN for slim FP32 vs. INT8 tests (#18214 ) * Enable MKL-DNN for slim FP32 vs. INT8 tests test=develop * added test for MobileNetV1 with MKL-DNN test=develop	6 years ago
bingyanghuang	976cf46010	slim QAT MKL-DNN readme (#18187 ) * QAT doc first version * follow the comments, test=develop * update the benchmark, test=develop * Change to channel-wise, test=develop	6 years ago
chengduo	5588b923f3	Add multi process reader (#18115 ) * add multi process reader test=develop	6 years ago
翟飞跃	802ea50956	fix spelling errors (#17941 ) * fix spelling errors; test=develop * Update API.spec update md5 * Update API.spec * change the order of api;test=develop	6 years ago
翟飞跃	78441c5449	add mkldnn Int8v2 slim doc (#17909 )	6 years ago
翟飞跃	c2fb9b906a	change dirname mobilenet to mobilenetv1 (#18160 )	6 years ago
bingyanghuang	5cf7e7415d	update mkldnn int8v2 doc (#18177 )	6 years ago
Wojciech Uss	ca5642c850	unify FP32 vs. INT8 comparison tests output (#18111 ) test=develop	6 years ago
Sylwester Fraczek	accb132f0f	fix slim int8 mkldnn multithreading issue (#18009 )	6 years ago
chengduo	b5a1c1463d	Update CPU_NUM config (#18059 ) * update CPU_NUM config test=develop	6 years ago
Wojciech Uss	78e932862c	Added unit test for QAT FP32 & INT8 comparison (#17814 ) * added unit test for QAT FP32 & INT8 comparison test=develop * enabled other models and updated filenames test=develop * added accuracy check and multiple batch handling test=develop * removed quantization_mkldnn_pass.py test=develop * cleanup test=develop * updated model paths test=develop * renamed tests without MKL-DNN test=develop * fix reusing mkldnn pool2d primitive test=develop * add performance measuring test=develop * fix accuracy statistics test=develop * removed non-mkldnn tests test=develop * added conv2d_depthwise->conv2d mkldnn transformation test=develop * format update test=develop * fixed creating key for pool2d grad test=develop * added pass * Fix the accuracy issue while using float precision to get the scale. test=develop * Fix the format issue when 'X' is not nchw. test=develop * removed output comparing and changed number of images test=develop * cmake and comment fix test=develop * updated acc threshold for QAT comparison tests test=develop * added OMP_NUM_THREADS setting test=develop * enable all QAT INT8 tests test=develop * restored upstream version of a file test=develop * modified directory names test=develop	6 years ago
Huihuang Zheng	0bf2535158	Cherry-pick: fix random CI failure. (#18011 ) * Cherry-pick fix random Python3 CI failure. In some tests, SWEs used "print('xxx').format('xxx')". The syntax is only supported in Python2, not python3. However, since those lines are related to data download, if the CI machines already have the data, it passes CI tests. That causes random failure. * Cherry-pick: disable CUDNN case of test_warpctc_op Also temporary disable a unit test. The test will be fixed under high priority.	6 years ago
Kaipeng Deng	96ee528e3e	fix logging basicConfig cannot be setting after import paddle (#17786 ) * fix logging unable. test=develop * unset sys.stdout for stream handler. test=develop * fix newly add basicConfig. test=develop * fix import error. test=develop	6 years ago
whs	82bfa0a5ba	Enhence auto pruning and fix save/load graph when using py_reader 1. Enhence auto pruning. 2. Fix save/load graph when using py_reader. test=develop	6 years ago
bingyanghuang	90ebce9ead	QAT int8 MKL-DNN transformation pass (#17819 )	6 years ago
翟飞跃	209a3f4e09	add log into slim mkldnn int8 test (#17926 )	6 years ago
whs	5df65e506d	Add Ligth-NAS for PaddleSlim (#17679 ) * Add auto pruning strategy. 1. Fix compressor. 2. Enhence graph executor. 3. Add SAController 4. Add auto pruning strategy. 5. Add unitest for auto pruning strategy. test=develop * Init light-nas * Add light nas. * Some fix. test=develop * Fix sa controller. test=develop * Fix unitest of light nas. test=develop * Fix setup.py.in and API.spec. test=develop * Fix unitest. 1. Fix unitest on windows. 2. Fix package importing in tests directory. * 1. Remove unused comments. 2. Expose eval_epoch option. 3. Remove unused function in search_agent. 4. Expose max_client_num to yaml file. 5. Move flops constraint to on_epoch_begin function test=develop * Fix light nas strategy. test=develop * Make controller server stable. test=develop * 1. Add try exception to compressor. 2. Remove unitest of light-nas for windows. test=develop * Add comments Enhence controller test=develop * Fix comments. test=develop	6 years ago
翟飞跃	993c703bcc	INT8 MKL-DNN v2 integrate to slim (#17634 ) * refactor PR 16865 * delete mergetool files * test=develop * test=develop * test=develop * test=develop * create dir for int8 model before call SaveOptimModel * test=develop * mkldnn int8 only support linux; test=develop * refine code; test=develop * remove comment; test=develop * refine code; test=develop * fix bug; test=develop * add exception for mkldnn_post_training_strategy * reuse int8v2 CAPI dataset; test=develop * fix accuracy check bug; test=develop * remove tab * convert files to unix format * test=develop * reduce CI time;test=develop * reduce CI time and refine code;test=develop * refine comment; test=develop * add cmake FLAGS;test=develop * remove predict_num;test=develop	6 years ago
Yiqun Liu	8fd39f3e99	Enhance fused_elementwise_activation op and add python api in contrib.layers (#17236 ) * Enhance fused_elementwise_activation op. test=develop * Move the api fused_elementwise_activation to contrib. test=develop * Add including files. test=develop * Add the support of sigmoid in fused_elementwise_activetion op. * Update API.spec. test=develop	6 years ago
chengduo	9322216170	Add data distributed_sampler (#17573 ) * add data parallel batch	6 years ago
Zhen Wang	3398f99608	Adding AddQuantDequantPass for TensorRT int8 (#17529 ) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop	6 years ago
Yibing Liu	f9796b1249	Add LAMB Optimizer support (#17489 ) * Add LAMB optimizer * Expose LAMB Optimizer's APIs test=develop, test=document_preview * Cleanup code & doc test=develop, test=document_preview * Update lamb optimizer's formula test=develop	6 years ago
Jie Fang	30e178fa2c	init auto loss scaling (#17194 ) * init auto loss scaling test=develop * change API.spec * change ifelse to switch and use reduce_sum to optimize checking isfinite test=develop * Remove redundant code test=develop	6 years ago
guomingz	6d1d7c8acf	Fix the KL algorithm bug when calculated the size of tensor. (#17198 ) * resolve #17184 Fix the KL algorithm bug when calculated the size of tensor. * Add the missing test=develop. test=develop	6 years ago
Zhen Wang	65541d83b0	add scale pass for calculating the output scales.test=develop (#17259 )	6 years ago
Zhen Wang	a40121e4c8	fix the initialization process error. test=develop (#17213 )	6 years ago
wopeizl	06325c4de1	disable the test_distillation_strategy temporarily (#17227 ) test_distillation_strategy always failed on a machine with 4 gpus only, disable temporarily and need to figure out the root cause and add it back later	6 years ago
guomingz	e4a52e082f	resolve #17159 (#17172 ) Update the folder name generation mechanism for saving the quantized model and weights. The folder name would be unique by adding the timestamp postfix. test=develop	6 years ago
Yibing Liu	beda78258f	Init mixed precision training interface (#16856 ) * Init mixed precision training interface * Add fp16 test script test=develop * All initializers support float16 test=develop * Code cleanup & add more code annotations test=develop * Update API spec test=develop * Add usage example in doc test=develop	6 years ago
Zhen Wang	b8c166f632	Merge pull request #17029 from wzzju/add_graph_checkpoint add checkpoint functions for graph. test=develop	6 years ago
Zhen Wang	bf6b9d6d37	add checkpoint functions for graph. test=develop	6 years ago
Wang, Chuanqi	45ebafc33e	Split test_calibration test to two tests (#16977 ) * Split test_calibration test to two tests test=develop * Modify CMakeLists file test=develop	6 years ago
whs	486f7d8ed6	Restore quantization and distillation stategy before loading persistables. (#16958 ) test=develop	6 years ago
Tao Luo	bc037c13c7	use multi-thread to speedup CI tests test=develop	6 years ago
chuanqiw	1818d1d9f0	Reduce test_calibration test time by using large batchsize test=develop	6 years ago
AIFollowers	fb1ae722d1	Fix model_stat importing in paddle/fluid/contrib	6 years ago
chengduo	bb80dae7d0	Add DecoupledWeightDecay (#16427 ) * Add DecoupledWeightDecay	6 years ago
whs	73c4f2b7b6	Fix distillation for soft label. (#16538 ) test=develop	6 years ago
Zhen Wang	4c1ec41de3	Merge pull request #16531 from wanghaoshuang/quan_ck [slim] Fix checkpoint of quantization strategy.	6 years ago
AIFollowers	e18ab78f67	add model_stat.py (#16512 ) * Add a tool to summary model's PARAMS, FLOPs in paddle/fluid/contrib.	6 years ago
wanghaoshuang	d41b623a72	Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into quan_ck test=develop	6 years ago
wanghaoshuang	6db7c2a500	Fix checkpoint of quantization.	6 years ago
Zhen Wang	f86429dbd9	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_slim_quant_bugs test=develop	6 years ago
whs	679a4c28fc	Fix lost of learning rate variable in distillatoin when using lr decay. (#16471 ) test=develop	6 years ago
Zhen Wang	6b854f3e1f	fix the save_in_nodes bug.	6 years ago
Zhen Wang	183bacebe3	clean codes and fix some bugs. test=develop	6 years ago
Tao Luo	1b4e4e7ef7	Merge pull request #16453 from chuanqi129/calibration_readme_refine Update INT8 calibration README	6 years ago
liuwei1031	8d22bc17a4	Memory optimize (#16410 ) * fix cdn issue, test=develop * fix memory optimize bugs, test=develop * fix memory optimize bugs, test=develop * remove add/sub_2 op, test=develop * disable memory_optimize by default, test=develop * disable inplace activation in python, test=develop * fix unittests, test=develop * fix unittests, test=develop * bug-fix, test=develop	6 years ago
chuanqiw	c512516ff4	Update INT8 calibration README test=develop	6 years ago
Zhen Wang	27d05203e7	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_quan_hang test=develop	6 years ago
Zhen Wang	1c11f817e9	Use the resolve hazard method.	6 years ago
Wang, Chuanqi	85e1cc1e02	Update Readme with new accuracy and performance data measured on 6271 (#16437 ) * Update Readme with new accuracy and performance data measured on 6271 test=develop * Modify tables and example command test=develop	6 years ago
whs	e9bec9369b	[slim] Add quantization strategy and distillation strategy. (#16408 ) * Add fsp operator. 1 Add unitest. 2. Add python API. 3. Add layer test. * Add quantization strategy. 1. Add API. 2. Add unitest. * Add distillatoin strategy. * Add unitest config file for quantization * Fix Copyright test=develop * Fix setup.py * Fix document of layers.py. test=develop * Fix unitest in python3. test=develop * Fix documents. test=develop * 1. refine fsp op by batched gemm 2. remove unused import test=develop * Fix test_dist_se_resnext. 1. disable test distillation. 2. reset framework.py test=develop * Enable unitest of distillation after fixing Block._clone_variable test=develop * Fix cdn issue. test=develop	6 years ago
Zhen Wang	2ccbfd5e10	Fix some bugs for quantization passes.	6 years ago
liuwei1031	de3b70a101	fix cdn issue, test=develop (#16423 ) * fix cdn issue, test=develop * fix cdn issue, test=develop	6 years ago
chuanqiw	431068c9ca	Enhance test calibration script on accuracy assert test=develop	6 years ago
whs	2e5831f0dc	[slim] Refine framework of slim and add filter pruning strategy (#16226 ) * First pr of paddle slim. 1. Add framework of paddle slim 2. Add filter pruning strategy test=develop * Rename unitest to tests. test=develop * Add prettytable into requirements. test=develop * Change in_nodes and out_nodes to odered dict. test=develop * Remove distillation. test=develop * Fix API.spec test=develop * Fix unitest. test=develop * Fix unitest in windows. test=develop * Fix unitest in windows. test=develop * Fix unitest. test=develop * Hide some functions. test=develop * Fix python import in python3.5 test=develop * Fix compress pass. test=develop * Fix unitest of test_dist_ctr. test=develop * Enhence flops. * use os.path.join * Fix pickle for python3 Fix log and comments. test=develop * 1. Remove feed_reader in compress pass 2. Fix cache reader 3. Rename CompressPass to Compressor 4. Add comments for distiller optimizer 5. Remove unused pruner currently 6. Add some comments. 7. Change API.spec test=develop * Fix pruning in python3. test=develop * Fix unitest in python3. test=develop * Fix format in python3. test=develop	6 years ago
Zhen Wang	ec11135d54	Merge pull request #16341 from wzzju/add_channel_wise_in_quant_pass Add channel wise in quant pass.	6 years ago
Zhen Wang	8965819fbb	rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop	6 years ago
Zhen Wang	ec88b6cc5a	add channel wise quantization in ir pass.	6 years ago
tangwei12	8ea4218ce1	update load persistables for increment, test=develop (#15576 ) * update load persistables for increment, test=develop * update load persistables for increment, test=develop * update API Spec, test=develop * update API Spec, test=develop * add doc, test=develop * add doc, test=develop * Update lookup_table_utils.py * Update API.spec * Update lookup_table_utils.py test=develop * Update API.spec test=develop * fix api spec * Update lookup_table_utils.py test=develop	6 years ago
achao2013	81b4fad8b9	add moving average absmax op and fix bug (#15155 ) * Add moving average absmax op in quantilize-aware training.	6 years ago
Zhen Wang	41b8cf0bae	Merge pull request #16162 from wzzju/fix_nan_static_quant Fix NaN bugs for static quantization strategy (mutil-cards train).	6 years ago
Zhen Wang	b8d1f5038e	Add the executor test for the graph clone API. test=develop	6 years ago
Zhen Wang	4ab0607634	fix NaN bugs.	6 years ago
Zhen Wang	ac6ef06ffa	Add the Clone method in Graph. test=develop	6 years ago
Zhen Wang	1b9c8d5f06	add clone function for IrGraph. test=develop	6 years ago
Zhen Wang	3cf0ee414d	update some details. test=develop	6 years ago
Zhen Wang	0cb50bb983	avoid ce fails on windows.	6 years ago
Zhen Wang	7c8f7df2fe	add some op_des funs to IrOpNode and add some var_des funs to IrVarNode. test=develop	6 years ago
Zhen Wang	33f99d6197	add IrNode&IrVarNode&IrOpNode. test=develop	6 years ago
Zhang, Guoming	e5d3d7c63d	resolve #15724 1.Remove the code for setting mkldnn environment in the test_calibration.py; 2.Update the cmake file for MKLDNN environment enabling; 3.Update the INT8 inference doc. test=develop	6 years ago
Zhen Wang	bc95a4ccfe	Merge branch 'develop' into quantization_inference_passes	6 years ago
sneaxiy	7e399b0628	rename test=develop	6 years ago
sneaxiy	42f6d0f899	modify API.spec test=develop	6 years ago
sneaxiy	f85245b409	test=develop	6 years ago
WangZhen	28dfad5e27	fix some bugs about python3. test=develop	6 years ago
WangZhen	2175292634	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantization_inference_passes test=develop	6 years ago
WangZhen	a7efab7ec1	add comments for public API. test=develop	6 years ago
Haihao Shen	b5ebca47a3	Add INT8 calibration README (#15548 ) * Add calibration README; test=develop	6 years ago
Haihao Shen	1b8047b712	Add INT8 calibration support in Paddle package (#15569 ) * Add INT8 calibration support in Paddle package; test=develop	6 years ago
Haihao Shen	294d594450	Enable performance measurement in INT8 calibration unit test (#15560 ) * Enable performance measurement in INT8 calibration unit test	6 years ago
WangZhen	0db41a9c44	add op_role attr when creating op node.	6 years ago
WangZhen	c67b29c178	fix some bugs of graph.to_program and get_pass.	6 years ago
Haihao Shen	f82515800c	Enable INT8 Calibration Unit Test for MobileNet-V1 (#15539 ) * Enable mobilenet UT in separate test class; use download cache by paddle download utility and cache unzip; and fix typo; test=develop * Extract cache_unzipping function for reuse; format code style; test=develop * Simplify the test code by define a combined function for both downloading and unzipping; test=develop	6 years ago
WangZhen	c64f22048a	add convert_to_int8 pass and transform_for_mobile pass and their UTs.	6 years ago
WangZhen	c8095eeb82	add freeze pass, and UT is passed.	6 years ago
Haihao Shen	2a82c5651e	Refine INT8 calibration API; shorten the iteration number to reduce test time; test=develop	6 years ago
乔龙飞 Qiao Longfei	c58555067e	Merge pull request #14731 from jacquesqiao/optimize-cpp-reader Optimize cpp reader	6 years ago
WangZhen	dde19a0ff8	add quantization freeze pass.	6 years ago
guomingz	dbdaf15ca0	[V1.3] Add the calibration tool code for int8 inference and focus test. (#15062 ) * Add the calibration tool code for int8 inference and focus test. * Fix the calibration tool per the review comments. test=develop * Update the calibrator doc and remove extra line. * Fix the invalid is_negative_input attr set on Mobilenet. * Add the comments and fix the format issue. test=develop * Update the CMakelist.txt for Calibration PR.Disable the Calibration UT if not enable MKLDNN. test=develop * Update the CMakeList.txt. test=develop * Disable the test_calibration case on WIN and MAC. test=develop * Add the missing brackets. test=develop * Remove the outdated map operator which not supported on Python3. test=develop * Fix the style issue. test=develop * 1.Update the CMakeList.txt to disable calibration tool ut when the WITH_MKL is not set; 2.Add the workaround to enable the FLAGS_use_mkldnn for PR_CI(PADDLE). test=develop * Fix the typo and format the License header. test=develop * 1.Add and Update TODOs per review comments. 2.Code clean. test=develop	6 years ago
Qiao Longfei	119a3d4deb	update comment test=develop	6 years ago
WangZhen	3b668c1574	Update some comments in the quantization transform pass. test=develop	6 years ago
WangZhen	b913463e83	Update according to the reviewers' suggestion. test=develop	6 years ago
WangZhen	3ce6172052	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization	6 years ago
WangZhen	59e5cc51d6	Add quantization transform pass and UT.	6 years ago
Qiao Longfei	cf0a057981	add document for ctr reader test=develop	6 years ago
Qiao Longfei	45578c1b48	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader	6 years ago
Dun	9f8f0fc2d3	Memory optimization of depthwise conv op and group norm op (#15313 ) * mem opt * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * refine code test=develop * refine code test=develop * refine code test=develop * refine code test=develop * refine with cub test=develop * fix mkldnn test && remove comments && test=develop * polish code && test=develop * add only_forward test && test=develop	6 years ago
WangZhen	e2ff300b02	add UT for quantization.	7 years ago
WangZhen	451896fce4	init quantization.	7 years ago
Qiao Longfei	edad60e612	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader	7 years ago
whs	938705745e	Init paddle slim (#14834 ) * Init slim. * Remove distillation demo. * Fix import errors. test=develop * Fix some issues. test=develop * Fix configs. test=develop * Modify API.spec. test=develop * Fix format. test=develop * Fix format. test=develop * Add some comments.	7 years ago
tangwei12	45dd3491db	lookup table utils fix (#14730 ) * fix utils config, test=develop * reweite tools in lookup table utils, test=develop * merge develop, test=develop * add copy in setup, test=develop * update api spec, test=develop * add doc, test=develop * prettifying doc, test=develop * Update API.spec update api.spec, test=develop * Update lookup_table_utils.py test=develop * Update lookup_table_utils.py test=develop * merge develop, test=develop * merge develop, test=develop * init fix, test=develop * fix in downloads, test=develop * fix in setup.in, test=develop	7 years ago
heqiaozhi	caa6b59677	add hdfs_utils & helper & node doc	7 years ago
heqiaozhi	7bd16e3afa	fix some bug & add log	7 years ago
heqiaozhi	5d3ecbfdf5	fix hdfs bug	7 years ago
heqiaozhi	009c7cf6cc	add finialize	7 years ago
heqiaozhi	57ac412b98	download data	7 years ago
Qiao Longfei	daba57f752	complete ctr_reader	7 years ago
Qiao Longfei	978fd6800c	update ctr_reader.py	7 years ago
Qiao Longfei	668ae9083e	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-ctr-reader	7 years ago
tangwei12	3639d99f99	Fix save and load lookup table/optimizer vars (#14301 ) * fix mkdir conflict * fix load/save lookup tables test=develop * add lookup_table_utils * fix load optimize vars on pserver * delete lookup table utils * fix save and load lookup tables * fix load optimizer var * fix load optimizer var, test=develop * fix python 3 style, test=develop * move lookup_table_utils to contrib utils	7 years ago
tangwei12	dd6fd4c747	Utils for download and upload files with HDFS (#14473 ) * add hdfs utils * add hdfs utils * test=develop * update hdfs utils and add demo * fix multi_download return local files * test=develop * add sync multi upload, test=develop	7 years ago
wopeizl	d9a1f3e58e	Windows/online (#14474 ) * add recordio support * disable the openblas multi-thread on windows since no support adjust the python script * code style * code style test=develop * add create_recordio_file_reader back * fix code style test=develop * fix the gtest.cmake on windows * fix cc_test on windows * fix the win build test=develop * remove fused compile support on windows test=develop * add the jit support test=develop * add the jit support, test=develop * add the jit support, test=develop * add the jit back fix compile error on windows * rollback test=develop * test case fix * disable DSO by default on windows * exclude warpctc_op on windows * exclude the dynload_warpctc out on windows test=develop * fix the scripts error test=develop * disable avx on windows by default test=develop * re-organize the cmake file * disable mkl on windows by default * add warp_ctc back * fix the dependency * fix the dependency * fix the build issue on windows * remove unsupported flag on windows * code style * code style test=develop * fix issue * add profiler, parallel_executor back * clean up the pre-definitions on windows * fix build issue * test=develop	7 years ago
peizhilin	1f12ba6192	gpu support, fix build issue: 1. Non utf-8 characters within comments of OPs may lead to protobuf fail to parse_from_string 2. comment out some ops which not supported on windows 3. cuda libs may not be correctly linked to target on windows	7 years ago
Qiao Longfei	aff54ef735	add ctr data	7 years ago
Dang Qingqing	f63ab5619b	Fix conflict.	7 years ago
Dang Qingqing	8f5d918a19	Disable one test in test_quantize_transpiler.	7 years ago
qingqing01	748be49e77	Fix random fail in Python3 (#13666 )	7 years ago
Dang Qingqing	209f799f91	test, test=develop	7 years ago
qingqing01	479ad4bb92	Merge branch 'develop' into quantize_transpiler_update	7 years ago

1 2 3 4 5 ...

274 Commits (abb49df26fe32e2cd73d1c709480f58412c810ae)