Commit Graph

9054 Commits (d9db94d75280680630f683b3efd45d5c226a9792)

Author SHA1 Message Date
Jie Fang d9db94d752 Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus. (#19714)
6 years ago
wangchaochaohu 47af618f70
Strided slice (#19642)
6 years ago
Zeng Jinle 13ca364ceb
remove some flags and add comments to some flags, test=develop (#19813)
6 years ago
Huihuang Zheng 3e1e1fee26
Return correct currrent block of a var (#19850)
6 years ago
123malin 1bc285a53a
add retry function to try to solve grpc error code 14 (#19661)
6 years ago
Bai Yifan e2c6bada36 Support dispensable student_loss in PaddleSlim distillation (#19824)
6 years ago
LielinJiang 6d72a86b14 fix_roi_transform_bug (#19785)
6 years ago
Zeng Jinle db26de8389
[Bug fix] Disable memory reuse on feeded variables (#19835)
6 years ago
chengjuntao 00efd1d8a9
add deformable conv v1 op and cpu version of deformable conv v2 (#18500)
6 years ago
chengduo b99fc38cec
Add fp16 support for dygraph (#19828)
6 years ago
Leo Chen 5fbf03d6b4 Enhance OpTest to support double grad inplace check (#19826)
6 years ago
liym27 677e714425 fix pow op, support tensor for agument factor. (#19313)
6 years ago
liym27 bd89a27308 add tensor support for argument shape in reshape op; (#19268)
6 years ago
liym27 88628016b2 add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208)
6 years ago
liym27 e9e3c08777 fix expand op: (#19302)
6 years ago
xujiaqi01 6bf298bf09
support preload thread, optimize hdfs log, fix master+patch bug (#19695)
6 years ago
Jiabin Yang cc311bdf95
Feature/add transform data dygraph (#19707)
6 years ago
lvmengsi b76343c3b7
cpu Conv double grad (#19672)
6 years ago
翟飞跃 93c85c930a Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770)
6 years ago
chengduo 2729c17446
Fix example error of Variable and Operator (#19821)
6 years ago
ruri a0e9b7b9a2
add unittest for square error cost op (#19746)
6 years ago
zhongpu 52673956de add kernel for squeeze_op, test=develop (#19656)
6 years ago
Chen Weihang 00d5375e0c
Add prune_backward function to cover complicated test_program.clone situation (#19772)
6 years ago
tangwei12 6a1db2044c
fix sync_with_distributed_lookup_table, test=develop (#19737)
6 years ago
Aurelius84 8c7e411908
Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716)
6 years ago
JesseyXujin e352467c1c
modify activation op API, delete use_cudnn args, test=develop, (#19758)
6 years ago
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
6 years ago
chengduo c308c88d71
Fix test_parallel_executor_test_while_train (#19723)
6 years ago
Zeng Jinle 078a678219
refine math_op_patch, test=develop (#19727)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
Aurelius84 22301115d0
Remove constraint that last dimension is forced to be 1 in huber_loss op (#19562)
6 years ago
Youwei Song 3e5fb6361b fix api-doc error for dygraph and backward (#19721)
6 years ago
Thunderbrook bda92434db remove trainer desc test in windows temporarily (#19753)
6 years ago
Youwei Song 810a33c8bd fix dygraph partitial backward problem (#19625)
6 years ago
Tao Luo ec9bc1bd9f
paddle::framework::vectorize() templatization (#19730)
6 years ago
Bai Yifan aa63d5ac6d Make the distillation process not save teacher variables in PaddleSlim (#19633)
6 years ago
Zeng Jinle bb4f8dee83
add logs to left var memory size, test=develop (#19722)
6 years ago
chengduo 2c30e64b2f
increase timelimit test_pe_serexnext (#19702)
6 years ago
wangguanzhong 25dcd74d34
merge empty lod tensor, test=develop (#19228)
6 years ago
gongweibao 6c2bc29cc0
Fix float16 optimizer. (#19682)
6 years ago
Chen Weihang c78a4781bf
Fix train error when test_program.clone is executed after optimizer.minimize (#19397)
6 years ago
zhongpu 5f627488db add kernel for unsqueeze_op and Add unsqueezed op test, test=develop (#19436)
6 years ago
Zeng Jinle a7691603a5
add gpu_allocator_try_time config, test=develop (#19675)
6 years ago
JesseyXujin 0b06db9413
delete transmission args in linear_chain_crf op (#19619)
6 years ago
123malin a25a716e87
Optimize fleet API: add input check for some interfaces (#18971)
6 years ago
Aurelius84 25c0eb28cc
fix unittest random failed test=develop (#19654)
6 years ago
chengduo 5c4eb394e9 Make test_pe_seresnext serial (#19634)
6 years ago
chengduo ac34aac33f Disable the test_parallel_executor_test_while_train temporarily (#19641)
6 years ago
Chen Weihang 73daa3d6c0
Code Cleanup: delete three useless raw variables in Conv2D (#19644)
6 years ago
Jie Fang c6a598a276 init new amp, optimize inserting cast op for batchnorm (#18596)
6 years ago