Commit Graph

749 Commits (43a82d83fa60be0524a752a10e522429783cbf67)

Author SHA1 Message Date
Jacek Czaja cef95ee30d [MKL-DNN] Refactoring Softmax (#19312)
6 years ago
Zeng Jinle 0a73f7202a
Add retry_allocator for gpu (#19409)
6 years ago
Jacek Czaja ecd9f330c9 [MKL-DNN] Fix to face model on AVX512 platforms (#19282)
6 years ago
liuwei1031 d6cb1a4122
add dynamic C runtime support on windows, test=develop (#19502)
6 years ago
Zeng Jinle c2c5b1b941
remove signal raise msg, test=develop (#19527)
6 years ago
Zeng Jinle caf59d0f3f
Add signal message to stderr (#19421)
6 years ago
Yi Liu efb05ba258
supports multiple NCCL communicators preserved in NCCLCommContext (#19407)
6 years ago
wopeizl b8aa37d529
save the callstack information to file when exception throws test=dev… (#19324)
6 years ago
Tao Luo 6527a7df67
replace part of PADDLE_ASSERT to PADDLE_ENFORCE (#19285)
6 years ago
Yihua Xu b920395842 Use sparse matrix to implement fused emb_seq_pool operator (#19064)
6 years ago
Zeng Jinle 91a0911ca3
Make PADDLE_ENFORCE_EQ support types that cannot be converted to std::string (#19243)
6 years ago
Zeng Jinle 708bd9798d
move_flags_to_unified_files_for_management, test=develop (#19224)
6 years ago
Zeng Jinle 002f325dcd
add PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#19211)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
wopeizl 80b7ef6fc8
add tensorrt support for windows (#19084)
6 years ago
Zhang Ting c2063217e7 optimize error message for "embedding" and "cross_entropy" OP (#18765)
6 years ago
liuwei1031 a43a763b54
fix warpctc.dll not found issue (#18761)
6 years ago
Zeng Jinle 08fa98f7cc
Fix gpu_info PADDLE_ENFORCE_GT when fraction_of_gpu_memory_to_use=1.0 (#18950)
6 years ago
Jacek Czaja 5cf2d38594 - Removed passing X from FWD to GRAD via device context (#18911)
6 years ago
Huihuang Zheng ea6ee76fa9
GPU allocation uses fraction of available memory (#18896)
6 years ago
Jacek Czaja cfcb96d2df [MKL-DNN] Fix int8 performance regression (#18758)
6 years ago
Huihuang Zheng 0d3f16f53e
Try to modify external gflags to solve CI compilation (#18872)
6 years ago
Huihuang Zheng cfce4994cf
Merge cuda 9/10 dockerfile with root dockerfile (#18693)
6 years ago
lidanqing 9ecd8ee789 change ComputeINT8 to template version to remove checking dst_datatype code (#18756)
6 years ago
Jacek Czaja 95c1816ec0 [MKL-DNN] Extended LRN with reusing via Acquire API (#18675)
6 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
6 years ago
Jacek Czaja 0d8e6c9b8b MKL-DNN upgrade to 0.20 (#18370)
6 years ago
zhouwei25 772e09560e Optimize the content of error reporting information, print error code and official document web sites (#18671)
6 years ago
Zeng Jinle ae58afc546
Feature/auto_growth_allocator (#18561)
6 years ago
liuwei1031 759530966c
print out error code of cudaGetDeviceProperties if failed (#18643)
6 years ago
Jacek Czaja 71d883b8ef [MKL-DNN] Reimplemented pool2d mkl-dnn to use Acquire API (#18585)
6 years ago
Tao Luo 076f833110
add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580)
6 years ago
gongweibao c0a82748cf
Polish backwards optimizer dependency codes and use more default values. (#18255)
6 years ago
Zeng Jinle be24e5b391
Clean unused code of dim and place (#18565)
6 years ago
Jacek Czaja 8869d7f735 Activations MKLDNN ops refactoring (#18191)
6 years ago
Jiabin Yang 667f88f9a6
Fix/gcc 4.8 ubt link error (#18558)
6 years ago
Physher 0caa08ea40 Add mkldnn int8 mul-op kernel (#17834)
6 years ago
Tao Luo fe32879d2a
add mkldnn shapeblob cache clear strategy (#18513)
6 years ago
chengduo 55baeceddb
Enhance execution error info (#18482)
6 years ago
Tao Luo 3f3112ceb0
add shape_blob for cache mkldnn primitive (#18454)
6 years ago
Leo Zhao 8f5fffca0a rename mkldnn set/get_cur_thread_id() to set/get_cur_mkldnn_session_id() (#18453)
6 years ago
Yi Liu a873fa84ce
supports collective training with programs (#18392)
6 years ago
Brian Liu 4bc2987d2f Fix bug in quantize kernel which cause crash in vgg16/19 model (#17964)
6 years ago
Leo Zhao 681d3553f1 Fix potential mkldnn concat/pool/conv kernel issues (#18393)
6 years ago
HaoRen 9931bc64f5 add dependecy of collective_helper (#18365)
6 years ago
Michał Gallus 8409693272 Reset DeviceContext after quantization warmup (#18182)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
Jacek Czaja c2efdfd5bc [MKL-DNN] Extending reusing to Elementwise_add_mkldnn op (#18146)
6 years ago
chengduo 4978db2c10
Remove nccl dep when the number of GPU is 1 (#18158)
6 years ago