Zeng Jinle
25ab23be28
Fix dygraph mem leak ( #18082 )
...
* fix dygraph mem leak, test=develop
* polish msg, test=develop
6 years ago
Zeng Jinle
6eec66a1b1
Fix py_reader iterable bug ( #18108 )
...
* fix py_reader iterable bug, test=develop
* move data from buffered_reader,test=develop
6 years ago
qingqing01
80d2e66f9e
Update backward appending stragety to support double backward and fix some bug. ( #18104 )
...
* Update backward.py:
- If there is no input grad var in all outputs of previous ops, do not append this op into graph.
- Only apply this stragety when double backward.
* Update some double backward op.
* Update sum_op to judge whether a tensor is empty by numel or IsInitialized().
6 years ago
Wojciech Uss
ca5642c850
unify FP32 vs. INT8 comparison tests output ( #18111 )
...
test=develop
6 years ago
Wojciech Uss
c26130f3a9
reuse C-API INT8 unit test application ( #18077 )
...
* reuse C-API INT8 unit test application
test=develop
* updates after review
test=develop
6 years ago
FlyingQianMM
ff83655f7e
add detection output operator for supporting retinanet ( #17896 )
...
* test=develop
add detection output for supporting retinanet
* test=develop
add test_layers.py
* test=develop
add API.spec
* test=develop
alter test_retinanet_detection_output.py
* test=develop
alter round 2
* test=develop
alter retinanet_detection_output
* test=develop
alter paddle/fluid/API.spec
* test=devlop
alter detection.py
* test=develop
alter retinanet_detection_output
* test=develop
alter paddle/fluid/API.spec
* test=develop
alter detection.py
* test=develop
alter API.spec
* test=develop
alter retinanet_detection_output
* test=develop
alter paddle/fluid/API.spec
* test=develop
alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py
* test=develop
alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py
* test=develop
fix grammer error
* test=develop
fix grammer error
* test=develop
fix grammer error
* test=develop
alter python/paddle/fluid/tests/unittests/test_layers.py
* test=develop
alter paddle/fluid/API.spec
6 years ago
FlyingQianMM
0aee1f0074
add sigmoid focal loss operator for supporting retinanet ( #17895 )
...
* test=develop
add sigmoid_focal_loss for supporting retinanet
* test=develop
add test_layers
* test=develop
add API.spc
* test=develop
alter sigmoid_focal_loss_op.cc
* test=develop
alter detection.py
* test=develop
alter API.spec
* test=develop
alter round 1
* test=develop
alter simooid_focal_loss
* test=develop
alter sigmoid_focal_loss_op.cc
* test=develop
alter test_layers.py
* test=develop
alter paddle/fluid/API.spec
* test=develop
alter sigmoid_focal_loss_op.cu
* test=develop
alter paddle/fluid/operators/detection/sigmoid_focal_loss_op.cc
6 years ago
FDInSky
9e4b9d9798
Update generate_proposal_labels_op to support CascadeRCNN. ( #17200 )
...
* Update generate_proposal_labels_op to support CascadeRCNN.
6 years ago
FlyingQianMM
9ed2f936f1
add target assign operator for supporting retinanet ( #17893 )
...
* test=develop add target assign for retinanet
* test=develop
run ci
* test=developp
add test_layers
* test=develop
add APi.spec
* test=develop
alter round 1
* test=develop
alter rpn_target_assign_op.cc
* test=develop
alter test_rpn_target_assign_op.py
* test=develop
alter rpn_target_assign_op.cc
* test=develop
alter API.spec
* test=develop
alter paddle/fluid/operators/detection/rpn_target_assign_op.cc
* test=develop
alter rpn_target_assign_op.cc
* test=develop
alter python/paddle/fluid/layers/detection.py
* test=develop
alter paddle/fluid/API.spec
6 years ago
Sylwester Fraczek
accb132f0f
fix slim int8 mkldnn multithreading issue ( #18009 )
6 years ago
chengduo
24e988a471
Fix bug of scope_buffered_ssa_graph_executor ( #18100 )
...
* fix code bug
test=develop
6 years ago
Huihuang Zheng
3f55ab0f89
Modify format of GPU allocation failure log. ( #18034 )
...
As title
test=develop
6 years ago
gongweibao
f5caf3443c
Fix reinitialized ncclid error! ( #18025 )
6 years ago
whs
354643d8d9
Add warning for cudnn warpctc kernel in CUDA9\CUDA10. ( #18046 )
...
test=develop
6 years ago
qingqing01
e81756f1ba
Hidden paddle.fluid.layers.detection_map. ( #18033 )
...
* Remove layers.detection_map API
* Since uers can use fluid.metrics.DetectionMAP to calculate mAP of current-batch and cumulative-batch. layers.detection_map only can calculate cur-batch mAP.
6 years ago
Yiqun Liu
660c1a65f3
Optimize fused_elewise_activation_grad op. ( #18041 )
...
test=develop
6 years ago
lidanqing
466254151a
add Mobilienet ssd int8 analyzer tester ( #18075 )
...
* add pascalvoc preprocess script and mobilenet-ssd analyzer_tester, wait 17737
* change converting local dataset to downloading and converting tarfile
test=develop
* change the test data_path
test=develop
* change copyright (c) 2016 to copyright (c) 2019
test=develop
6 years ago
石晓伟
42f12a4aca
fix ci test cmake test=develop ( #18060 )
6 years ago
chengduo
b5a1c1463d
Update CPU_NUM config ( #18059 )
...
* update CPU_NUM config
test=develop
6 years ago
lidanqing
f8ecc3de89
refactor the function ConvFwdPrimitiveDesc ( #17897 )
...
* refractor the function ConvFwdPrimitiveDesc
test=develop
* change according to review
test=develop
* use pointer way without boost::optional
test=develop
* pass vector to function by reference instead of raw vector
test=develop
* change pointer to shared_ptr
test=develop
6 years ago
Michał Gallus
8462e2b805
Disable MKLDNN FC in Resnet50 test ( #18030 )
6 years ago
Wojciech Uss
78e932862c
Added unit test for QAT FP32 & INT8 comparison ( #17814 )
...
* added unit test for QAT FP32 & INT8 comparison
test=develop
* enabled other models and updated filenames
test=develop
* added accuracy check and multiple batch handling
test=develop
* removed quantization_mkldnn_pass.py
test=develop
* cleanup
test=develop
* updated model paths
test=develop
* renamed tests without MKL-DNN
test=develop
* fix reusing mkldnn pool2d primitive
test=develop
* add performance measuring
test=develop
* fix accuracy statistics
test=develop
* removed non-mkldnn tests
test=develop
* added conv2d_depthwise->conv2d mkldnn transformation
test=develop
* format update
test=develop
* fixed creating key for pool2d grad
test=develop
* added pass
* Fix the accuracy issue while using float precision to get the scale.
test=develop
* Fix the format issue when 'X' is not nchw.
test=develop
* removed output comparing and changed number of images
test=develop
* cmake and comment fix
test=develop
* updated acc threshold for QAT comparison tests
test=develop
* added OMP_NUM_THREADS setting
test=develop
* enable all QAT INT8 tests
test=develop
* restored upstream version of a file
test=develop
* modified directory names
test=develop
6 years ago
tensor-tang
566bf2ec56
concat op support negative axis ( #18045 )
...
test=develop
6 years ago
Yiqun Liu
7e463c84a6
Optimize the concat and split cuda implementation for cases when the number of inputs/outputs is less than 5. ( #17979 )
...
test=develop
6 years ago
tangwei12
101f74cb19
fix save/load in fleet ( #17675 )
...
* fix save/load in Fleet
* add UT framework of Fleet
6 years ago
hutuxian
f1d458daf0
add trainer_desc proto DEPS ( #18019 )
6 years ago
Guo Sheng
a06b316b94
Fix GetExpectedKernelType of add_position_encoding_op ( #17935 )
...
* Fix the GetExpectedKernelType of add_position_encoding_op.
test=develop
* Fix the doc of lstm_unit outputs in nn.py.
test=develop
6 years ago
tensor-tang
5c06bff222
combine noavx and avx package ( #17889 )
...
* support avx and noavx core
* add catch and give some log
test=develop
* fix build
test=develop
* add missing package
test=develop
* fix pybind name
test=develop
* fix import error
test=develop
* conbime noavx core
test=develop
* add requirements
test=develop
* fix unkown message
test=develop
* fix api spec
test=develop
* refine and clean
test=develop
* update
* pass dist ut
* follow comments
test=develop
* refine scripts
test=develop
6 years ago
wawltor
8eb134c3c1
Fix scatter and gather op when has duplicate index ( #17952 )
...
* test=develop
The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode.
* test=develop
Fix some code format problem, and the same time add the test case in gather and scatter op
6 years ago
lujun
75fcd29220
update load_error_info, test=develop ( #18000 )
...
Repair error prompt: Users are prompted to check whether the model or parameter files are damaged when loading parameters are wrong.
6 years ago
石晓伟
04ea7cb069
modify the access level of anakin engine ( #18015 )
...
test=develop
6 years ago
wawltor
2ae8decc90
test=develop ( #17984 )
...
Fix bug in sequence_unpad op, when allocate the output memory do not match actual memory, check memory failed. Fix this bug by allocating the output memeory in correct code position.
6 years ago
ruri
9d6640ff44
Fix edit distance doc ( #17947 )
...
* fix im2sequence padding bug, test=develop
* fix edit_distance, test=develop
* add API.spec,test=develop
6 years ago
Zeng Jinle
a1bdf25ecb
Add shape not match doc to data layer ( #17936 )
...
* add shape not match doc to data layer, test=develop
* fix API.spec md5
test=develop
6 years ago
cjt222
871af28d6c
add deformable psroi pooling ( #17827 )
...
* add deformable psroi pooling
* test=develop
* test=develop
* test=develop
modify format
* fix bug
* test=develop run ci
* test=develop
add API.spec
* add test_layers.py
* run ci again
* test=develop
run ci again
* run ci again
* test=develop
run ci again
* test=develop
run ci again
* test=develop
run ci again
* add space between two lines
* test=develop
add space between two lines
* test=develop
add space between lines
* test=develop
modify comment in nn.py
* test=develop
add space between two lines
* test=develop
add space between two lines
* update API.spec
* run ci again
* test=develop
run ci again
* rerun ci
* test=develop
rerun ci
* change input shape
* run ci
* test=develop
run ci
* modify format of nn.py
* test=develop
* test=develop
* test=develop
update API.spec
* test=develop
fix API doc
* modify API comment
* modift API comment
* test=develop
update API.spec
* test=develop
modify comment
* test=develop
modift comment
* test=develop
modift comment
* test=develop
update API.spec
* test=develop
modify comment
* test=develop
add inference in nn.py
* test=develop
update API.spec
* test=develop
resolve confict
* test=develop
update API.spec
6 years ago
SunGaofeng
40885c225b
add unfold op (new op),test=develop ( #17944 )
...
* add unfold op
test=develop
* fix divide bug in python3 when calculating output width and height
test=develop
* add name=None in python api, move redundant code into inline function
* try to trigger ci for this code
test=develop
6 years ago
Jacek Czaja
84bb45c054
[MKL-DNN] Thread-Safety for MKL-DNN reusing Part 1 ( #17965 )
...
* - removed is_reusing_
* - Added TID to keys for reusing apart from softmax PD
* - compilation fix
* - Yet another compilation fix
* - Batch Norm and Conv adapted
* - Fix to softmax MT
* - Fixes to MT code of MKL-DNN
* - Lint fixes
test=develop
6 years ago
gongweibao
da9143c1cc
Polish codes of old prs. ( #17938 )
6 years ago
石晓伟
bce259e5bf
Update the Anakin interfaces for content-dnn and MLU ( #17890 )
...
* update anakin-engine interfaces for content-dnn
test=develop
* support only-gpu mode of Anakin
modify eltwise parse
test=develop
* modification for thread-safe
test=develop
* Integrated template instance
test=develop
* increase template parameters
test=develop
* support MLU predictor
test=develop
* update anakin cmake files
test=develop
* update TargetWrapper::set_device
* update the initialization of anakin subgraph
test=develop
* use the default constructor of base class
test=develop
6 years ago
hutuxian
969e6378b9
Pipeline Concurrency ( #17402 )
...
Add Pipeline Concurrency Train Mode:
- Cpp: pipeline_trainer & section_worker
- Python: PipelineOptimizer
- Add a new data_feed type: PrivateInstantDataFeed
- Add a test demo of pipeline trainer and the test model is gnn
- Do not support win32 now
6 years ago
Zhaolong Xing
4e8d5a034f
Light mem reuse strategy for inference. ( #17925 )
...
* fix: when use the load model from memory mode, the RAM occupy is high
test=develop
* ligth mem reuse
test=develop
* fix cpplint
test=develop
6 years ago
Tao Luo
53fd507bae
fix merge conflict of 'Remove attribute in Allocator::Allocate' and elementwise_add_mkldnn_op ( #17949 )
...
test=develop
6 years ago
zhaoyuchen2018
3847d9fc2c
refine sum stack api doc ( #17923 )
...
test=develop
6 years ago
jerrywgz
aab4d12c0e
refine GetExpectedKernelType in conat op, test=develop ( #17934 )
6 years ago
Zeng Jinle
3ece61f71e
Remove attribute in Allocator::Allocate ( #17878 )
...
* remove attribute in Allocator::Allocate, test=develop
* fix travis ci error, test=develop
6 years ago
Yibing Liu
33d1e56506
Enable seq_pool op to accept len 0 input ( #17284 )
...
* Enable seq_pool op to accept len 0 input
test=develop
* Update sequence_pool's api
test=develop
* Add more unittest cases for seq_pool op
test=develop
* Remove legacy comments
test=develop
* Don't use template in op maker
test=develop
6 years ago
Yihua Xu
9b5017366a
Fix the format issue when 'X' is not nchw. ( #17833 )
...
test=develop
6 years ago
Hongyu Liu
8062bd510c
Reshape support tensor attribute ( #17781 )
...
* add reshape support tensor; test=develop
* fix reshape bug; test=develop
* change reshape attribute default value; test=develop
* fix reshape input name; test=develop
* fix reshape unitest; test=develop
* check dim tensor shape; test=develop
6 years ago
gongweibao
972c54cd70
Fix FLAGS_fuse_parameter_memory_size unit from Bytes to MBytes. ( #17924 )
6 years ago
Zeng Jinle
0a96ec699c
fix conv v7 workspace size limit error, test=develop ( #17902 )
6 years ago
Jiabin Yang
4d5f6937c3
Feature/refine api for dygraph ( #17907 )
...
* WIP
* WIP
* test=develop, add api doc and example code for dygraph
6 years ago
gongweibao
dd4cd352c7
Fix sync_batch_norm_op ncclallreduce error! ( #17918 )
6 years ago
whs
5df65e506d
Add Ligth-NAS for PaddleSlim ( #17679 )
...
* Add auto pruning strategy.
1. Fix compressor.
2. Enhence graph executor.
3. Add SAController
4. Add auto pruning strategy.
5. Add unitest for auto pruning strategy.
test=develop
* Init light-nas
* Add light nas.
* Some fix.
test=develop
* Fix sa controller.
test=develop
* Fix unitest of light nas.
test=develop
* Fix setup.py.in and API.spec.
test=develop
* Fix unitest.
1. Fix unitest on windows.
2. Fix package importing in tests directory.
* 1. Remove unused comments.
2. Expose eval_epoch option.
3. Remove unused function in search_agent.
4. Expose max_client_num to yaml file.
5. Move flops constraint to on_epoch_begin function
test=develop
* Fix light nas strategy.
test=develop
* Make controller server stable.
test=develop
* 1. Add try exception to compressor.
2. Remove unitest of light-nas for windows.
test=develop
* Add comments
Enhence controller
test=develop
* Fix comments.
test=develop
6 years ago
Zeng Jinle
3925bd81e8
Fix cuda/cudnn version detection error ( #17853 )
...
* fix cuda/cudnn version detection error, test=develop
* fix again, test=develop
6 years ago
Yihua Xu
14a32bf0c4
Fix the accuracy issue while using float precision to get the scale. ( #17884 )
...
test=develop
6 years ago
gongweibao
fbbdc9ccad
Add backward and optimizer operator dependency pass. ( #17746 )
6 years ago
mozga-intel
c1379bf238
[NGraph] Bert model for a capi, ngraph's support test=develop ( #17844 )
6 years ago
baojun
e2c1b7c354
[NGraph] cache compiled function instead test=develop ( #17845 )
6 years ago
石晓伟
d008260fa8
update the initialization of anakin subgraph ( #17880 )
...
test=develop
6 years ago
Zhaolong Xing
ae576f3c68
fix: when use the load model from memory mode, the RAM occupy is high ( #17788 )
...
test=develop
6 years ago
Zhaolong Xing
5efe8c7287
fix bug: the lod_tensor_to_array op will aplly a new var but not release when dong inference ( #17856 )
...
test=develop
6 years ago
Jiabin Yang
022dfed4fc
Add optimizer save and load ( #16986 )
...
* save optimizer related vars in dygraph
* test=develop, add optimizer save and load
* test=develop, add optimizer save and load
* test=develop, merge code and add multi-optimizer save and load
* test=develop, fix test_imperative_checkpoint
* test=develop, fix include error
* test=develop, fix include error
* test=develop, renew api spec
* test=develop, refine code
* test=develop, set default value for checkpoint
* test=develop, fix ci error
* test=develop, change API.spec and make api more readable
* test=develop, refine version and time stamp
* test=develop, add example code and refine code
* test=develop, refine doc
* test=develop, change version
6 years ago
wopeizl
453a49b1bc
Make ParallelExecutor support Windows GPU ( #17787 )
...
* fix the ParallelExecutor on Windows
test=develop
* restrict to use one GPU only under windows
6 years ago
pawelpiotrowicz
39bc8a55a4
[NGraph] Enable ngraph layer_norm operator ( #17599 )
...
* Enable ngraph layer_norm operator
test=develop
* Disable/Enable cuda, new unit-test test=develop
* Fix use_cudnn test=develop
* Fixed test_layer test, new funciton is added test=develop
* set use_cudnn by default test=develop
6 years ago
翟飞跃
993c703bcc
INT8 MKL-DNN v2 integrate to slim ( #17634 )
...
* refactor PR 16865
* delete mergetool files
* test=develop
* test=develop
* test=develop
* test=develop
* create dir for int8 model before call SaveOptimModel
* test=develop
* mkldnn int8 only support linux; test=develop
* refine code; test=develop
* remove comment; test=develop
* refine code; test=develop
* fix bug; test=develop
* add exception for mkldnn_post_training_strategy
* reuse int8v2 CAPI dataset; test=develop
* fix accuracy check bug; test=develop
* remove tab
* convert files to unix format
* test=develop
* reduce CI time;test=develop
* reduce CI time and refine code;test=develop
* refine comment; test=develop
* add cmake FLAGS;test=develop
* remove predict_num;test=develop
6 years ago
wopeizl
841553e13f
use pyreader to read data in dygraph mode ( #17314 )
...
* use pyreader to read data
* add return_list to PyReader to support return value represented as list
6 years ago
chengduo
5436d66667
close socket connect ( #17862 )
...
test=develop
6 years ago
baojun
a4c528a31c
[NGraph] some ngraph updates to enable bert ( #17739 )
...
* delay infershape test=develop
* fall back subblock to paddle test=develop
* fix edge cases test=develop
* remove output duplicates test=develop
* handle reshape2_grad infershape test=develop
6 years ago
Jiabin Yang
3d3f5506d2
Feature/Fix recurrent usage of Varbase in Dygraph ( #17838 )
...
* for debug
* test=develop, memory optimize for dygraph using shared_ptr
* test=develop, fix travis ci showed error
* test=develop, fix bug for recurrent usage of varbase
* test=develop, init varbase when it need to be Add
* test=develop, fix problem of recurrent gradient
* test=develop, add gradient test for recurrent varbase usage
6 years ago
Zeng Jinle
674e0ce2d6
Use Python C-API to speed up dygraph trace ( #17837 )
...
* use python api to reduce python time cost, test=develop
* fix travis ci, test=develop
* fix Py_None error,test=develop
6 years ago
jerrywgz
5e4f99dd74
refine doc for prelu ( #17810 )
...
* refine doc for prelu
6 years ago
chengduo
d1169afaa3
remove InstallFailureSignalHandler ( #17828 )
...
test=develop
6 years ago
chengduo
437520474c
fix DropLocalExeScopes ( #17829 )
...
test=develop
6 years ago
Leo Zhao
50326563d5
enable mkldnn primitive reuse for platform reorder ( #17826 )
...
test=develop
6 years ago
baojun
7611208ab7
[NGraph] added gather_grad to ngraph test=develop ( #17646 )
6 years ago
tensor-tang
557452e778
update and polish hash op doc ( #17809 )
...
* update and polish hash op doc
test=develop
* update api spec
test=develop
6 years ago
jerrywgz
92d9bdfce2
fix api doc in slice op, test=develop ( #17804 )
6 years ago
Hongyu Liu
dfec676270
expand op supprt tensor attribute ( #17773 )
...
* expand support tensor attribute; test=develop
* fix bug ; test=develop
* fix uni test bug; test=develop
* fix copy bug; test=develop
* refine expand_times default value; test=develop
6 years ago
Jiabin Yang
3b70f870e2
Using Smart pointer to optimizer memory usage of dyGraph ( #17768 )
...
* for debug
* test=develop, memory optimize for dygraph using shared_ptr
* test=develop, fix travis ci showed error
* test=develop, fix bug for recurrent usage of varbase
* test=develop, init varbase when it need to be Add
6 years ago
Hongyu Liu
82358bfdc1
ont hot support tensor depth ( #16972 )
...
* support some input tensor remain on cpu; test=develop
* fix input = none; test=develop
* fix unfound bug; test=develop
* fix proto None case; test=develop
* fix bug; test=develop
* fix proto null bug; test=develop
* remove conv check; test=develop
* fix test bug; test=develop
* move fill constant; test=develop
* no change in proto; test=develop
* fix bug; test=develop
* change attr detph name; test=develop
* remove remain cpu; test=develop
* fix bug; test=develop
* merge develop; test=develop
* fix one_hot bug; test=develop
* fix bug; test=develop
* fix bug; test=develop
* fix bug; test=develop
* fix python api bug; test=develop
6 years ago
Brian Liu
7cfddf22c8
Optimize bilinear interpolate op with OpenMP ( #17800 )
...
Refactor the code to be OpenMP friendly
test=develop
6 years ago
Yibing Liu
d6d33fd748
Add update method for ema ( #17812 )
6 years ago
wangchaochaohu
c10157a5df
revise the cudnn conv choose algorithm to improve the performance(mask rcnn benchmark) ( #17753 )
...
* revise conv layer cudnn algo choose test=develop
* update for code style test=develop
* update for code style test=develop
6 years ago
chengduo
863c75168c
polish error doc ( #17772 )
...
test=develop
6 years ago
Tao Luo
e089e454a1
make omp thread num default 1 after inference run ( #17801 )
...
test=develop
6 years ago
mozga-intel
6a6bf597f7
[NGraph] Enable elementwise_div operator test=develop ( #17515 )
...
* Enable elementwise_div operator test=develop
* Fix update date test=develop
6 years ago
Huihuang Zheng
931698a54a
Modify doc of program_guard, py_reader, data, and clone ( #17727 )
...
Note the append_batch_size variable is doing prepend. We should
change the name, but due to backward compatibility, I suggest to
change at v2.0. Not now.
test=develop
6 years ago
lidanqing
d7c5c2bd64
Add input format in Transpose GetHash ( #17737 )
...
* fix the bug of mobilenet-ssd INT8 inference without overloading GetHash
test=develop
* remove the out_grad->format() in TransposeMKLDNNGradOpKernel
test=develop
6 years ago
tangwei12
659b72a97c
fix document of python api get_startup_program() ( #17764 )
...
* add example to get_startup_program()
* fix example to get_startup_program()
6 years ago
AIFollowers
93de124cec
modify some initializer api ( #17301 )
...
* test=develop modify some initializer api
* test=develop modify API.spec
* test=develop modify API.spec
* test=develop modify API.spec
* test=develop modify API.spec
6 years ago
guru4elephant
d52391094d
fix prepare context redundant code problem, optimize executor by cach… ( #17743 )
...
* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop
* cache sub_scope, program, var when use_program_cache=True is set
* make fetch_list runable with variables, add more unittest for use_program_cache
6 years ago
baojun
2c58f1a83c
[NGraph] Added lookup table to ngraph engine test=develop ( #17647 )
6 years ago
pawelpiotrowicz
bacc822492
[NGraph] Enable transpose ngraph operator ( #17636 )
...
test=develop
6 years ago
lujun
ed9d603a8a
fix api doc: Optimizer.ModelAverage ( #17395 )
6 years ago
baojun
90eae0b39a
[NGraph] Addded slice op to ngraph test=develop ( #17648 )
6 years ago
baojun
2fbaa5c075
[NGraph] added matmul op to ngraph engine test=develop ( #17645 )
6 years ago
hong19860320
68dcb1bd7b
fix API examples of assign, reverse and array_write, etc. ( #17287 )
...
* fix API examples of assign, reverse and array_write
test=develop
* update API.spec
test=develop
* update API examples for array_length, array_read, array_write, assign, hard_sigmoid, hsigmoid, increment, ones, pow, reverse, uniform_random and zeros
* update API.spec for assign, reverse and array_write, etc.(#17287 )
* test=develop
6 years ago
chengduo
67c8dade58
Add Event in ScopeBuffer Executor ( #17667 )
...
* add event for fast executor and add threads for scopebuffer executor
test=develop
6 years ago
Bai Yifan
bba57cdd82
Add deformable conv v2 op,test=develop ( #17145 )
...
* unit commits, test=develop
* update API.spec, test=develop
6 years ago
wangchaochaohu
bd48950c7e
fix paddlepaddle API examples ( #17306 )
...
* API.spec test=develop
* update
* update test=develop
* update test=develop
* update
* update test=develop
* update test=develop
* update test=develop
* update test=develop
* update test=develop
* test=develop
* update
* update test=develop
* update test=develop
6 years ago
YishengCheng
bd15912d65
fix bug for ctr_reader for svm data ( #17575 )
...
* fix bug for ctr_reader test=develop
* fix svm data test=develop
fix svm data test=develop
6 years ago
Yiqun Liu
8fd39f3e99
Enhance fused_elementwise_activation op and add python api in contrib.layers ( #17236 )
...
* Enhance fused_elementwise_activation op.
test=develop
* Move the api fused_elementwise_activation to contrib.
test=develop
* Add including files.
test=develop
* Add the support of sigmoid in fused_elementwise_activetion op.
* Update API.spec.
test=develop
6 years ago
yaoxuefeng
ac92e4c066
fix distributed_transpiler.py api test=develop ( #17668 )
6 years ago
Yiqun Liu
2704479bb2
Optimize recurrent_op using Prepare and RunPreparedContext, avoiding create operators in every iter. ( #17689 )
...
test=develop
6 years ago
pawelpiotrowicz
9b99876442
Enable less_than ngraph operator ( #17642 )
...
* Enable less_than ngraph operator
test=develop
* Added compare unit-tests test=develop
* Update: date && removed import test=develop
6 years ago
Zhaolong Xing
a9a531fa5f
Refine python api code example note: ( #17369 )
...
* fix:
1. infernce multi card occupy
2. facebox model inference occupy too much
test=develop
* refine python api comments: shuffle, while, scale, sampled_softmax_with_cross_entropy, scatter,
round, sin, sqrt, shape, split, soft_relu, slice, selu, ifelse, switch.
test=develodp
* fix conflict error.
test=develop
6 years ago
Jiabin Yang
effc555955
test=develop, layz init Grad ( #17653 )
6 years ago
hutuxian
4ff87c049d
remove useless input 'Softmax@GRAD' from softmax_with_cross_entropy op ( #17612 )
6 years ago
Tao Luo
b4b169467b
add fc_mkldnn_pass in compare_mkldnn ( #17712 )
...
test=develop
6 years ago
pawelpiotrowicz
70a887af63
[NGraph] Add reduce_sum operator for Ngraph ( #17450 )
...
test=develop
6 years ago
baojun
29baca0dd8
add depthwise_conv2d op to ngraph engine ( #17454 )
...
* add depthwise_conv2d test=develop
* use cpu for ngraph test=develop
6 years ago
gongweibao
0d561ef442
fix 2dconn test=develop ( #17681 )
6 years ago
mozga-intel
ccf9e2327b
[Lite] Enable cast operator test=develop ( #17294 )
6 years ago
tangwei12
0d3c48e0a8
fix doc in transpiler, test=develop ( #17313 )
...
* fix doc in transpiler, test=develop
6 years ago
Hongyu Liu
9f85f21880
Add new gard clip [old gradient clip not support in dy graph] ( #17523 )
...
* add gradient clip in minimize; test=develop
* fix bug; test=develop
* fix format; test=develop
* move new grad clip to dygraph/grad_clip.py; test=develop
* fix lr decay and grad clip test; test=develop
* seperate dygraph grad clip; test=develop
* fix grad clip test; develop
* fix api spec bug; test=develop
* add blank line, test=develop,test=document_preview
to fix format problem
6 years ago
Zhaolong Xing
4337009b92
fix trt ci timeout error ( #17701 )
...
test=develop
6 years ago
mozga-intel
5eb81fe595
Capi for a ngraph engine ( #17037 )
6 years ago
Yiqun Liu
5782dddad0
Optimize the concat and split kernel for specical cases when the number of inputs/outputs is 2 ( #17415 )
...
* Optimize the concat and split kernel for special cases that the number of inputs/outputs is 2.
test=develop
* Refine codes.
test=develop
* Correct the condition.
test=develop
* Move the define of tmp_data outside the if statement.
* Print the cudnn minor version.
test=develop
* Fix the case when in_num/o_num is 1 in concat/split op.
test=develop
* Remove const_cast.
test=develop
6 years ago
石晓伟
acbb4bf38d
update python API examples ( #17351 )
...
* update python APIs
test=document_preview test=develop
* update API.spec
test=document_preview
test=develop
* update merge_selected_rows
* update API.spec
test=document_preview
test=develop
* update API.spec
test=document_preview
test=develop
* fix the comment of less_than
test=develop
test=document_preview
* update API.spec
test=develop
test=document_preview
* update API.spec
test=develop
test=document_preview
* update API.spec
test=develop
test=document_preview
* update API.spec
test=develop
* update API
test=develop
6 years ago
lidanqing
04b6c29ee0
Improve mobilenetv2 INT8 performance by using INT8 relu as post-op ( #17570 )
...
* add INT8 conv+relu6 fuse and enbale mobilentv2 INT8 test
test=develop
* change fasle and 0.0 to fuse_brelu and brelu_threshold
test=develop
change the "fuse_relu||fuse_brelu" to "unsigned_output"
test=develop
* Use relu instead of brelu as INT8 post-op because INT8 brelu is not enabled in mkldnn v0.18
test=develop
* continuous-integration fix
test=develop
6 years ago
Jacek Czaja
6d8075ecef
[MKL-DNN] conv_transpose mkldnn bias pass ( #17644 )
...
* - changes to graph detector
- Changes to pass
- Added ut for new pass
- use_pass
- Added pass to mkldnn passes
- fix to registration
- improved verbose messaging for conv bias passes
- Lint fixes
test=develop
* - Lint fixes
test=develop
6 years ago
Shuai Yuan
41f1186c6b
[DOC][PYTHON] Fix api docs, test=develop, test=document_preview ( #17629 )
...
* [DOC] Fix api docs, test=develop, test=document_preview
* [DOC] Fix api annotation: fluid.layers.tensor_array_to_tensor. test=develop, test=document_preview
* test=develop, test=document_preview
update MD5 of tensor_array_to_tensor
6 years ago
wopeizl
058f1f1e1b
fix the api example for create_global_var, create_parameter, SGDOptim… ( #17371 )
...
* fix the api example for create_global_var, create_parameter, SGDOptimizer, RMSPropOptimizer, MomentumOptimizer, LarsMomentumOptimizer, FtrlOptimizer
test=develop
* add example for adamoptimizer
fix API.spec
test=develop
* test=develop
* test=develop
6 years ago
Yibing Liu
4f4f0993c1
Bias correction for exponential moving average ( #17677 )
...
* Bias correction for exponential moving average
test=develop, test=document_preview
* Fix docs
test=develop, test=document_preview
6 years ago
Tao Luo
962eed6f82
Revert "Enable SQRT operator for the nGraph Bridge ( #17549 )" ( #17680 )
...
This reverts commit f34830e2aa
.
6 years ago
Tao Luo
67a6297a9f
update unique_name notes and examples ( #17671 )
...
test=develop
6 years ago
Krzysztof Binias
f34830e2aa
Enable SQRT operator for the nGraph Bridge ( #17549 )
...
* Enable sqrt operator for the nGraph Bridge.
test=develop
* Update activation_op.h
6 years ago
Sylwester Fraczek
96845d2168
add Concat quantization ( #17448 )
...
* add Concat quantization
add unit test for quantizing concat
fix for wrong value when the input is not in map of calculated scales
add use_quantizer to concat_op.cc
add scale_algo rules for concat
test=develop
* missing fix for multiple inputs quantize-squash
* wojtuss review fix: adding comment
test=develop
6 years ago
Zeng Jinle
432ac70124
clean code of py_layer in dygraph mode,test=develop ( #17661 )
6 years ago
gongweibao
65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. ( #17263 )
6 years ago
Krzysztof Binias
b1bd483a7d
[NGraph] Enable gelu operator for the nGraph Bridge. ( #17547 )
...
test=develop
6 years ago
Zhen Wang
8bd651b7ed
Fix the bug in the AnalysisPredictor and add more directions about io APIs. ( #17639 )
...
* fix the bug that sub_scope_ may be null in AnalysisPredictor::Run.
* add more directions about io APIs' docs.
* update the API.spec. test=develop test=document_preview
6 years ago
chengduo
343017324e
Polish Print Op ( #17651 )
...
* enhance print
6 years ago
Zeng Jinle
4aa931dd85
Code clean of Allocator ( #17602 )
...
* Revert "Revert "Fix allocator bug""
This reverts commit 174d0d0b90
.
* Revert "fix travis ci"
This reverts commit 5656fa9f7c
.
test=develop
* add inlined_vector.h, test=develop
* add inlined_vector_test,test=develop
* clean code of allocator,test=develop
* delete zero_size_allocator.h,test=develop
* fix failed unittest,test=develop
6 years ago
Guo Sheng
430e25654b
Fix the usage of out_grad lod in sequence_slice_op. ( #17625 )
...
test=develop
6 years ago
Bai Yifan
bbd6e438fc
fix conflicts,test=develop ( #17186 )
6 years ago
bdzhuxiaoning
9f85afb7b6
test=develop ( #17643 )
6 years ago
chengduo
9322216170
Add data distributed_sampler ( #17573 )
...
* add data parallel batch
6 years ago
hutuxian
1670db5e86
Gather Op Index Support int64_t datatype ( #17610 )
...
* gather_op support int64_t index by adding a template typename
* add UT and rename typename
test=develop
6 years ago
mozga-intel
2b83d75bfa
Enable elementwise pow operator for ngraph ( #17526 )
6 years ago
Zhaolong Xing
61221ebc28
TRT: Support set dynamic range in int8 mode. ( #17524 )
...
* fluid int8 train and trt int8 predict align.
trt int8 predict init
op converter
* 2. align fluid int8 train and trt int8 inference.
enhance quant dequant fuse pass
enhance op converter, trt engine, trt engine op, trt subgraph pass.
* 3. add delete_quant_dequant_pass for trt
test=develop
* 4. add the missing file
test=develop
* 5. i modify the c++ interface, but forget to modify the pybind code
fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
test=develop
6 years ago
Michał Gallus
0c39b97b4e
[MKL-DNN] Add Fully Connected Op for inference only( #15226 )
...
* fuse mul and elementwise add to fc
* Reimplement the FC forward operator
* Fix FC MKLDNN integration by transposing weights
* Add FC MKLDNN Pass
test=develop
* FC MKLDNN Pass: change memcpy to std::copy
* Fix MKLDNN FC handling of mismatch input and weights dims
* Lower tolerance for MKL-DNN in resnet50 test
test=develop
* Adjust FC to support MKLDNN Op placement
test=develop
* Adjust Placement Op to set use_mkldnn attribute for graph
test=develop
* MKLDNN FC: fix weights format so that gemm version is called
test=develop
* FC MKLDNN: Remove tolerance decrease from tester_helper
* FC MKL-DNN: Refactor the code, change input reorder to weight reorder
* MKL-DNN FC: Introduce operator caching
test=develop
* FC MKL-DNN: Fix the tensor type in ExpectedKernelType
test=develop
* FC MKL-DNN: fix style changes
test=develop
* FC MKL-DNN: fallback to native on non-supported dim sizes
test=develop
* FC MKLDNN: fix CMake paths
test=develop
* FC MKLDNN: Refine placement pass graph mkldnn attribute
test=develop
* Fix Transpiler error for fuse_conv_eltwise
test=develop
* Fix missing STL includes in files
test=develop
* FC MKL-DNN: Enable new output size computation
Also, refine pass to comply with newest interface.
test=develop
* FC MKL-DNN: enable only when fc_mkldnn_pass is enabled
* FC MKL-DNN: Allow Weights to use oi or io format
* FC MKL-DNN: Adjust UT to work with correct dims
test=develop
* Enable MKL DEBUG for resnet50 analyzer
test=develop
* FC MKL-DNN: Improve Hashing function
test=develop
* FC MKL-DNN: Fix shape for fc weights in transpiler
* FC MKL-DNN: Update input pointer in re-used fc primitive
* Add log for not handling fc fuse for unsupported dims
test=develop
* FC MKL-DNN: Move transpose from pass to Op Kernel
test=develop
* FC MKL-DNN: Disable transpose in unit test
test=develop
* FC MKL-DNN: Remove fc_mkldnn_pass from default list
* Correct Flag for fake data analyzer tests
test=develop
* FC MKL-DNN: Add comment about fc mkldnn pass disablement
test=develop
* FC MKL-DNN: Disable fc in int8 tests
test=develop
6 years ago
wopeizl
6724a652f3
add __str__ method for tensor and lodtensor to support print test=dev… ( #17588 )
...
* add __str__ method for tensor and lodtensor to support print test=develop
6 years ago
Krzysztof Binias
e9216d0602
Enable logical operators for the nGraph Bridge. ( #17543 )
...
test=develop
6 years ago
Hongyu Liu
cbaf9e5344
Fix api example [ lstm, sequence_enumerate, sequence_expand,sequence_expand_as ] ( #17210 )
...
* fix example; test=develop
* fix api spec; test=develop
* fix api spec; test=develop
* add doc check
test=develop
test=document_preview
* test=develop,test=document_preview
add blank line to fix format, add one more "import"
* fix bug; test=develop
* fix bug; test=develop
6 years ago
guru4elephant
326bf8291a
add Run Prepared Ctx ( #17616 )
...
add Run Prepared Ctx, fix pybind problem
6 years ago
Yibing Liu
e8990e64f6
Fix trust ratio in lamb ( #17614 )
...
test=develop
6 years ago
Guo Sheng
2a7b321110
Fix the example code in some Python API. ( #17343 )
...
* Fix the example code in some Python API.
test=develop
* Fix the example code in some Python API by adding import.
test=develop
6 years ago
chengduo
b5f4d5ed0e
Add broadcast operators ( #17503 )
...
* This PR adds broadcast for multi-process. And it could be used in dynamic graph to broadcast parameters.
6 years ago
flame
2280f185d7
BuildStrategy api comment ( #17348 )
...
Python examples of fluid.layers.io.double_buffer and some BuildStrategy's methods.
6 years ago
Sylwester Fraczek
5b2a3c4b12
Conv concat relu quantization ( #17466 )
...
* add conv_concat_relu fuse
test=develop
* add test code
test=develop
* added missing include with unordered_map
test=develop
* review fixes for wojtuss
test=develop
* remove 'should (not) be fused' comment statements
one of them was invalid anyway
test=develop
6 years ago
Sylwester Fraczek
bccb0ba49a
fix quantize_squash_pass segfault when no tensor linked to Bias ( #17292 )
...
* fix quantize_squash_pass segfault when there is no tensor linked do Bias input
test=develop
* add googlenet test
test=develop
* fix concat CreateKey not using input format
test=develop
6 years ago
chengduo
2dc1c6f25c
Add profiler in tracer ( #17076 )
...
* add profiler in tracer.cc
* add profiler in layer.cc
test=develop
* add profiler in Layer.cc
test=develop
6 years ago
mozga-intel
0d4cbdad91
[NGraph] Enable elementwise mul operator ( #17552 )
6 years ago
tianshuo78520a
cee9dcc383
Delete LoDTensorset in API.spec ( #17577 )
...
* test=develop
* test=develop
* test=develop
* del #
6 years ago
mozga-intel
f2694e122d
[NGraph] Enable assign operator for a ngraph, test=develop ( #17437 )
...
* Enable assign operator for a ngraph, test=develop
* Cross_entropy operators needs to be updated
6 years ago
mozga-intel
cf02cb5e98
Enable elementwise sub operator for ngraph ( #17527 )
6 years ago
guru4elephant
7f8bc49d00
polish_executor_and_add_ctx_cache ( #17536 )
...
* polish_executor_and_add_ctx_cache
6 years ago
tensor-tang
7ae461eb13
[CPU] refine cpu softmax bwd ( #17534 )
...
* refine softmax fwd
test=develop
* refine cpu softmax bwd
test=develop
* fix batch size
test=develop
* fix compile issue with gpu
test=develop
* add value clip
6 years ago
Yibing Liu
6e11f97708
Add exponential moving average ( #17562 )
...
* Add exponential moving average
test=develop, test=document_preview
* Polish documents
test=develop, test=document_preview
* Update API spec
test=develop, test=document_preview
6 years ago
tensor-tang
0600b370ea
[CPU] refine softmax op fwd on CPU ( #17522 )
...
* refine softmax fwd
test=develop
* fix compile issue wih gpu
test=develop
* add value clip to avoid exp
6 years ago
Zeng Jinle
c6189637cd
Fix allocator bug ( #16712 )
...
* Revert "Revert "Fix allocator bug""
This reverts commit 174d0d0b90
.
* Revert "fix travis ci"
This reverts commit 5656fa9f7c
.
test=develop
* add inlined_vector.h, test=develop
* add inlined_vector_test,test=develop
6 years ago
mozga-intel
035771512d
Enable elementwise min operator for ngraph ( #17521 )
6 years ago
Kaipeng Deng
cf60e5a2db
fix API python example ( #17226 )
...
* fix api example. test=develop
* fix API.spec. test=develop
* fix spectral_norm format. test=develpp
* merge develop
* add import. test=develop
* fix indent. test=develop
* fix indent. test=develop
* add import fluid. test=develop
6 years ago
Qiao Longfei
92e7d5d7cc
fix distribute doc test=develop ( #17318 )
...
* fix distribute doc
6 years ago
jerrywgz
c1aae8b8d2
Fix GetExpectedKernelType in Concat op ( #17459 )
...
* fix concat op vartype check, test=develop
6 years ago
Qiao Longfei
58f7695ab2
Async exe support communicator ( #17386 )
...
Async exe support communicator
6 years ago
Zhaolong Xing
38da103034
fix trt ci bug temporary. ( #17565 )
...
ban all trt ut. will fix it later.
test=develop
6 years ago
mozga-intel
109b5aed5a
[NGraph] Enable reshape operator test=develop ( #17512 )
6 years ago
zhang wenhui
9bb6a421e3
fix bpr_loss data_norm teacher_student_sigmoid_loss api & fix continuous_value_model ( #17331 )
...
* fix bpr data_norm teacher_student_sigmoid , test=develop test=document_preview
修复了bpr_loss data_norm teacher_student_sigmoid_loss三个api, 同时修复了continuous_value_model文档英文拼写错误
6 years ago
lijianshe02
300bd7504d
fix api-doc related bugs test=develop test=document_preview ( #17360 )
...
* fix api doc according to the reviewer's comment test=develop
6 years ago
lijianshe02
daf88968e2
fix bug that saved optimal model path in test_analyzer_save_model con… ( #17555 )
...
* modify saved model path in analyzer_save_model.cc test=develop
6 years ago
Krzysztof Binias
43d15b9d96
Enable square operator for the nGraph Bridge. ( #17551 )
...
test=develop
6 years ago
Sevin F. Varoglu
f86f49e779
[NGraph] add increment op to ngraph engine ( #16929 )
...
* add increment op to ngraph engine
test=develop
* fix style errors
test=develop
6 years ago
baojun
8923612b10
NGraph enable parse serialized graph test=develop ( #17453 )
6 years ago
Yiqun Liu
cf5d271c5a
Fix examples of fluid.layers.sums and fluid.layers.DynamicRNN ( #17308 )
...
* Fix examples of fluid.layers.sums.
test=document_preview
* Correct the example of DynamicRNN and its functions.
test=develop
* Add 'import paddle.fluid as fluid' to examples.
test=develop
* Update API.spec.
test=develop
* Add space lines.
test=develop
* Update the API.spec.
test=develop
6 years ago
guomingz
2281ebf0f3
Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. ( #17130 )
...
* Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization.
Below table shows the benchmark(FPS) which measured on skx-8180(28 cores)
Batch size | with fusion | without fusion
-- | -- | --
1 | 214.7 | 53.4
50 | 1219.727 | 137.280
test=develop
* Fix the format issue
test=develop
* Add the missing nolint comments.
test=develop
* Fix the typos.
test=develop
* Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine.
test=develop
* Adjust the indentation.
test=develop
* Add the test_conv_brelu_mkldnn_fuse_pass case.
test=develop
* Slightly update the code per Baidu comments.
Let the parameter definition embedded into the code.
That's will make the code easy to understand.
test=develop
6 years ago
Yibing Liu
f9796b1249
Add LAMB Optimizer support ( #17489 )
...
* Add LAMB optimizer
* Expose LAMB Optimizer's APIs
test=develop, test=document_preview
* Cleanup code & doc
test=develop, test=document_preview
* Update lamb optimizer's formula
test=develop
6 years ago
mozga-intel
99ab57123c
Enabled ngraph elementwise max operator ( #17517 )
6 years ago
Tao Luo
3d19f44a89
remove unused SERIAL compiler option ( #17500 )
...
test=develop
6 years ago
zhaoyuchen2018
dfdcd91869
Add api doc code examples ( #17285 )
...
* Add api doc code examples
add or fix topk, squeeze, stack, StaticRNN,
StaticRNN memory in doc
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* Add squeeze md5.
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* Add import package
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
6 years ago
mozga-intel
1eb151752e
Enable abs operator for a ngraph test=develop ( #17436 )
6 years ago
lidanqing
36757ed203
Enabling resnet101, vgg16, vgg19 INT8v2 model tests ( #17468 )
...
* Add 6 models tests support in CMake
* enabling resnet101, vgg16, vgg19 INT8v2 model tests
test=develop
* remove SERIAL
test=develop
6 years ago
liuwei1031
ba70cc499e
fix security bugs : ( #17464 )
...
http://newicafe.baidu.com:80/issue/PaddleSec-33/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-28/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-25/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-24/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-21/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-20/show?from=page
test=develop
6 years ago
Zhaolong Xing
ff7f911b4d
add quant_dequant_moving_avg_max_abs op ( #17480 )
...
* add quant_dequant_moving_avg_max_abs op
test=develop
* add more note for quantdequant op
test=develop
6 years ago
Qiao Longfei
287de41c04
Optimize communicator flags ( #17494 )
...
* optimize communicator flag
* change flags in init py test=develop
6 years ago
liuwei1031
c3949f5699
remove two useless flags: enable_subgraph_optimize, memory_optimize_debug, test=develop ( #17491 )
6 years ago
liuwei1031
f82e4d75e7
improve the doc of paddle.fluid.memory_optimize, test=develop ( #17473 )
...
* improve the doc of paddle.fluid.memory_optimize, test=develop
* fix typo, test=develop
6 years ago
Tao Luo
32da5e9c3d
remove unused expected_kernel_cache_pass ( #17486 )
...
test=develop
6 years ago
wopeizl
ca3ba378c7
fix the random compilation failure on windows test=develop ( #17475 )
...
* fix the random compilation failure on windows
6 years ago
lvmengsi
10b23a72c1
Double backward elementwise div ( #17416 )
...
* double backward, elementwise_div
* fix dx empty. test=develop
* bug fix (#17392 )
fix secure bug
* Eanble stack operator for a Ngraph, test=develop (#17406 )
* fix sqrt_grad_grad unittest. test=develop (#17410 )
* fix sqrt_grad_grad unittest. test=develop
* disable sqrt_grad_grad unittest. test=develop
* test=develop, fix unittest
* test=develop, fix unittest
* test=develop, fix unittest
* test=develop, fix bug
* fix unittest. test=develop
* fix unittest dx. test=develop
* tmp fix! for test... test=develop
* reduce tmp, test=develop
* test=develop, reduce tmp
* fix broadcast unittest. test=develop
* fix format. test=develop
* refine code. test=develop
* refine code. test=develop
* refine GetDoubleGradSafeTensor. test=develop
* fix format. test=develop
6 years ago
qingqing01
97f0ec2357
Fix compiling error with cuDNN 5.1 ( #17458 )
...
test=develop
6 years ago
Zeng Jinle
3d4e8268c6
fix recurrent fwd bug when no backward and scope clear ( #17460 )
6 years ago
lvmengsi
977e9fcb27
support elementwise_sub double backward ( #17476 )
...
add elementwise_sub_grad_grad op for backward of backward calculation
6 years ago
jiaqi
75cda4d9df
fix data_feed_desc.py example run error ( #17452 )
...
* fix data_feed_desc.py example run error
test=develop
test=test=document_preview
* fix data_feed_desc.py example display error
test=develop
test=document_preview
* update API.spec for DataFeedDesc
test=develop
test=document_preview
6 years ago
chengduo
5a6ab38013
Add record event And remove CSP ( #17447 )
...
* add record_event
test=develop
* remove csp
test=develop
6 years ago
Yan Xu
0217555530
polish parallel dygraph code ( #17164 )
...
* add var grad hook test=develop
6 years ago
Jiabin Yang
d7df4e5e5b
Fix/Fix memory leak in dygraph ( #17394 )
...
* test=develop, add gradient sort backward strategy
* test=develop, fix test by add FLAGS_cudnn_deterministic on new tests
* test=develop, fix memory leak in dygraph mode
* test=develop, fix memory leak in dygraph mode
* test=develop, polish code
* test=develop, polish code
* test=develop, polish code
6 years ago
Qiao Longfei
728bbaa4e3
add cache_update_mutex_ for operator test=develop ( #17124 )
...
* add cache_update_mutex_ for operator
6 years ago
Bai Yifan
3a9ae28d32
fix assert,test=develop ( #17445 )
6 years ago
zhaoyuchen2018
b02f2aff04
Add conditional compile for gru opt ( #17368 )
...
* improve gru unit performance.
refine code
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* Add conditional compile for gru opt
Not enable gru opt if compute ability < 700
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* refine code.
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
6 years ago
liuwei1031
6a53fa95e7
improve the API Sample of DataFeeder, memory_optimize and release_memory ( #17374 )
...
* improve the API Sample of DataFeeder, memory_optimize and release_memory, test=develop
* update API.spec, test=develop, test=document_preview
* tweak the code format of feed API, test=develop
* update API.spec, test=develop
* improve doc for DataFeeder and default_main_program, test=develop
6 years ago
guru4elephant
43c9561e9a
add inductive shape index ( #17435 )
...
add inductive shape index
6 years ago
Zeng Jinle
712bfb17cb
fix recurrent_op,test=develop ( #17433 )
6 years ago
Tao Luo
5babcd02dd
Revert "remove unnecessary prepare_data ( #17080 )" ( #17432 )
...
This reverts commit aca60e9a20
.
6 years ago
chengduo
e336dc86bb
[Speed] Refine the Executor when the num_thread=1 ( #17405 )
...
Refine the Executor when the num_thread=1
6 years ago
Jie Fang
30e178fa2c
init auto loss scaling ( #17194 )
...
* init auto loss scaling
test=develop
* change API.spec
* change ifelse to switch and use reduce_sum to optimize checking isfinite
test=develop
* Remove redundant code
test=develop
6 years ago
Zhen Wang
4a1b7fec96
Add setting Scope function for the graph class ( #17417 )
...
* add set_not_owned function for graph
* add scope set. test=develop
* add scope_ptr enforce not null before setting.test=develop
6 years ago
mozga-intel
6ee6700fac
Eanble stack operator for a Ngraph, test=develop ( #17406 )
6 years ago
flame
e48dd92fc8
bug fix ( #17392 )
...
fix secure bug
6 years ago
jiaqi
66d51206b1
add save/load model, shrink table, cvm, config file & fix pull dense bug ( #17118 )
...
* add save/load model, shrink table, cvm, config file & fix pull dense bug
test=develop
* fix global shuffle bug, fix pull dense bug, fix release memeory bug, fix shrink error
add client flush, add get data size
test=develop
* fix global shuffle bug
test=develop
* fix global shuffle bug
test=develop
* fix code style
test=develop
* fix code style & modify pslib cmake
test=develop
* fix error of _role_maker
test=develop
* fix code style
test=develop
* fix code style
test=develop
* fix code style
test=develop
* fix code style
test=develop
* fix code style
test=develop
* fix windows compile error of fleet
test=develop
* fix global shuffle bug
* add comment
test=develop
* update pslib.cmake
test=develop
* fix fill sparse bug
test=develop
* fix push sparse bug
test=develop
6 years ago
Krzysztof Binias
0823a7bc8b
Optimize the sequence padding op ( #17403 )
...
test=develop
6 years ago
baojun
1ce7b45b9e
NGraph Added fill_zeros_like op test=develop ( #17295 )
6 years ago
baojun
910196524d
NGraph Added dropout and dropout_grad to ngraph test=develop ( #17320 )
6 years ago
mozga-intel
b189480734
Ngraph Enable gather operator test=develop ( #17296 )
6 years ago
lvmengsi
4ef631013c
Double backward sqrt ( #17387 )
...
* double backward sqrt
* refine unittest. test=develop
* refine test. test=develop
* remove alpha in unittest. test=develop
6 years ago
JesseyXujin
829fcc98fb
Fix some APIs' example
...
* test=develop
* test=develop
* test=develop
6 years ago
Zeng Jinle
eab34b2df6
fix_dygraph_mem_leak, test=develop ( #17396 )
6 years ago
lvmengsi
5d1ac41b00
Double backward reduce mean ( #17372 )
...
* test=develop, double backward reduce_mean
* add comment. test=develop
* fix format. test=develop
* rename GradGrad -> DoubleGrad. test=develop
* fix op_use_default_grad_op_maker.spec. test=develop
6 years ago
jerrywgz
0cae5a36b6
enhance generate mask labels, test=develop ( #17380 )
6 years ago
Kaipeng Deng
bd9bef5a4e
add elementwise_add_grad_grad op ( #17366 )
...
* add elementwise_add_grad_grad op. test=develop
* use defined GradMaker. test=develop
6 years ago
jerrywgz
1c6d064627
add collect fpn proposals op,test=develop ( #16074 )
...
* add collect fpn proposals op,test=develop
6 years ago
Kaipeng Deng
60be66e2c0
support fc_op double grad ( #17317 )
...
* add double grad for mul_op. test=develop
* fix format. test=develop
* fix format. test=develop
* fix format. test=develop
* refine code. test=develop
* remove setzero. test=develop
* fix dx/dy init bug. test=develop
* fix format. test=develop
6 years ago
Zhen Wang
ad8bbe587e
Fix some api example codes' bugs and these APIs include load_inference_model, load_vars, save_vars, L1DecayRegularizer and L2DecayRegularizer. ( #17324 )
...
* fix some api example codes' bugs.
* update API.spec. test=develop test=document_preview
* add import fluid. test=develop test=document_preview
6 years ago
Tao Luo
68ec0a6f74
make parallel_executor support FLAGS_use_mkldnn ( #17341 )
...
* make parallel_executor support FLAGS_use_mkldnn
test=develop
* add warning when set mkldnn_enabled_op_types_ in non-mkldnn env
test=develop
6 years ago
liuwei1031
0863599323
Fix the uninitialized gru_value.output_value. ( #17197 )
...
test=develop
6 years ago
zhoukunsheng
2ff7ea3337
Expose sign op ( #17117 )
...
* test=develop
add sign op
* Update nn.py
test=develop
delete stop_gradient assignment
6 years ago
Yihua Xu
218d8d8f73
Optimize the computing kernel of sequence_reverse operator ( #17349 )
...
* Optimize the computing kernel of sequence_reverse operator.
test=develop
* Clean code
test=develop
* Fix for cpplint syntax checking.
test=develop
* Fix the compile warning issue.
test=develop
6 years ago
Yiqun Liu
dcda20233c
Optimize the elementwise op using eigen ( #15494 )
...
* Optimize the elementwise op with CUDA kernels.
test=develop
* Support setting of attr in op config file.
test=develop
* Add the support the setting dtype and initializer in config.
test=develop
* Save workspace.
* Add initializer "zeros".
test=develop
* Fix compiling error.
* Support the use of existed file to initailize tensor in op_tester.
* Use eigen to optimize the elementwise_add/mul for the case that x and y have the same dims.
test=develop
6 years ago
Jiabin Yang
4624d7c642
test=develop, add gradient sort backward strategy ( #17125 )
...
* test=develop, add gradient sort backward strategy
* test=develop, fix test by add FLAGS_cudnn_deterministic on new tests
6 years ago
qingqing01
1d0ba5e815
Fix the example code in some Python API ( #17333 )
...
* Fix the example code in some Python API
* Update paddle/fluid/API.spec
* Fix some examples format
6 years ago
Kaipeng Deng
8bae8590ac
add double grad for elementwise_mul op ( #17255 )
...
* add double grad for elementwise_mul. test=develop
* remove comment. test=develop
* fix grad sum. test=develop
* fix for axis expand. test=develop
* add test for axis expand. test=develop
6 years ago
Kaipeng Deng
11d3a38f25
add double grad for square op ( #17173 )
...
* add double grad for square. test=develop
* formax code. test=develop
* fix for grad sum. test=develop
* refine shape. test=develop
* refine extract. test=develop
6 years ago
Jiabin Yang
31536016ea
test=develop, test=document_preview, fix 13 api doc and code ( #17293 )
...
* test=develop, test=document_preview, fix all 13 api doc and code
* test=develop, fix rst
* test=develop, refresh API.spec
6 years ago
chengduo
bc833945a4
Add DropLocalExeScopes in ParallelExecutor ( #17297 )
...
* reset drop local scope counter
test=develop
6 years ago
zhoukunsheng
d4b67e1692
Add Where Op( #16793 )
6 years ago
zhoukunsheng
1bfff02047
Add Diag Op( #17027 )
6 years ago
zhaoyuchen2018
8a2caacdbc
improve gru unit performance. ( #16338 )
...
refine code
fuse cublas calling and kernels into one cuda kernel.
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
6 years ago
SunGaofeng
ddb24d48c5
test=develop ( #17322 )
6 years ago
qingqing01
e32c9888f5
Double backward of conv2d. ( #17211 )
...
* Add conv2d_grad_grad_op
* Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h.
- Now use it in conv2d_grad_grad.
- Will simply the searching code in conv2d and conv2d_grad in next PR.
* Enhance and fix bug in unit testing of gradient_checker.
* Support to fetch empty variables,return None in Python.
6 years ago
Zeng Jinle
5e5e7b3305
fix data_type error message ( #17312 )
...
test=develop
6 years ago
Zeng Jinle
fff270eacd
follow comments,test=develop ( #17273 )
6 years ago
Zhaolong Xing
7a3bb061d8
fix: ( #17279 )
...
1. infernce multi card occupy
2. facebox model inference occupy too much
test=develop
6 years ago
xiaoting
50ad9046c9
add import, test=develop ( #17229 )
6 years ago
zhoukunsheng
4292bd8687
Mod floordiv ( #17251 )
...
* test=develop
add elementwise_mod and elementwise_floordiv, fix equation problem in elementwise_mod
6 years ago
guru4elephant
5d6a1fcf16
fix infer_from_dataset and train_from_dataset ( #17243 )
...
* fix train_from_dataset and infer_from_dataset example
* add inductive dim for data_reader, example: shape=[-1, 1], then -1 will be inducted through run-time reading of number of elements
6 years ago
chengduo
516317cf91
use sync copy ( #17291 )
...
test=develop
6 years ago
Huihuang Zheng
2c4462711f
Fix API example code of save_inference_model ( #17274 )
...
* Fix API example code of save_inference_model
test=develop
* Add "import" in exmaple of save_inference_model
* Fix typo "exsample" -> "example"
test=develop
6 years ago
xiaoting
9ed4aaada4
modified formula for Lrn ( #17281 )
...
* modified formula for lrn
test=develop
* modified api.spec
test=develop
6 years ago
zhaoyuchen2018
792443ef23
Refine elementwise kernel. ( #16952 )
...
* Refine elementwise kernel.
Add a simple cuda kernel if grad x and y both exist
Use 2D block cuda kernel to do broadcast.
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* refine code.
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* refine code.
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
6 years ago