Commit Graph

698 Commits (40885c225b61fb6d7a413f6ef2a70ee73c3e995c)

Author SHA1 Message Date
Jacek Czaja 84bb45c054 [MKL-DNN] Thread-Safety for MKL-DNN reusing Part 1 (#17965)
6 years ago
hutuxian 969e6378b9
Pipeline Concurrency (#17402)
6 years ago
Zeng Jinle 3ece61f71e
Remove attribute in Allocator::Allocate (#17878)
6 years ago
Zeng Jinle 3925bd81e8
Fix cuda/cudnn version detection error (#17853)
6 years ago
chengduo d1169afaa3
remove InstallFailureSignalHandler (#17828)
6 years ago
Leo Zhao 50326563d5 enable mkldnn primitive reuse for platform reorder (#17826)
6 years ago
wangchaochaohu c10157a5df
revise the cudnn conv choose algorithm to improve the performance(mask rcnn benchmark) (#17753)
6 years ago
chengduo 863c75168c
polish error doc (#17772)
6 years ago
gongweibao 0d561ef442
fix 2dconn test=develop (#17681)
6 years ago
gongweibao 65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. (#17263)
6 years ago
wopeizl 6724a652f3
add __str__ method for tensor and lodtensor to support print test=dev… (#17588)
6 years ago
mozga-intel f2694e122d [NGraph] Enable assign operator for a ngraph, test=develop (#17437)
6 years ago
Zeng Jinle c6189637cd
Fix allocator bug (#16712)
6 years ago
mozga-intel 109b5aed5a [NGraph] Enable reshape operator test=develop (#17512)
6 years ago
guomingz 2281ebf0f3 Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130)
6 years ago
qingqing01 97f0ec2357 Fix compiling error with cuDNN 5.1 (#17458)
6 years ago
Zeng Jinle eab34b2df6
fix_dygraph_mem_leak, test=develop (#17396)
6 years ago
qingqing01 e32c9888f5
Double backward of conv2d. (#17211)
6 years ago
zhaoyuchen2018 792443ef23
Refine elementwise kernel. (#16952)
6 years ago
chengduo db5e74ab95
update assert (#17282)
6 years ago
baojun 7bd1d03ee5 Adding lrn op for ngraph engine (#17189)
6 years ago
Tao Luo ff1661f12a
remove unused FLAGS_warpctc_dir (#17162)
6 years ago
Huihuang Zheng e4a5332416
Fix a typo in gpu_info.cc (#17175)
6 years ago
Huihuang Zheng b9494058b3
Use CudnnWorkspaceHandle in exhaustive search (#17082)
6 years ago
Zeng Jinle 0c335dcd2c
Make conv cudnn workspace size configurable (#17036)
6 years ago
Zeng Jinle 1202d3fc74
Refine model gpu memory (#16993)
6 years ago
gongweibao cbdb8a17b1
Polish DGC code (#16818)
6 years ago
xuezhong 742d758747 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_infershape_bug2
6 years ago
xuezhong 5663fbfb0a fix infershape bug
6 years ago
Jacek Czaja 87a44b1149 [MKL-DNN] Added reusing of primitive descriptors (fp32) (#16667)
6 years ago
dongdaxiang a659b37ace make lodtensor_printer usable in gpu setting
6 years ago
Chen Weihang 0b2aec14b6 Revert "Model data cryption link all lib (#16555)"
6 years ago
Chen Weihang c38c7c5619
Model data cryption link all lib (#16555)
6 years ago
guru4elephant 76b49f02ee
Merge pull request #16539 from guru4elephant/train_with_pipe_reader_merge_develop
6 years ago
gongweibao fea91164b7 Fix windows compilation error! (#16546)
6 years ago
dongdaxiang 3a79be6eb3 refine API spec
6 years ago
dongdaxiang 98dda08a85 fix pull sparse slow problem
6 years ago
dongdaxiang 93c3c7f9b3 fix dataset testcase problem
6 years ago
dongdaxiang d739bab844 fix async_executor problem and remove some unnecessary testcase, fix trainer_desc import problem
6 years ago
dongdaxiang e3107a6ae0 fix windows compile problem
6 years ago
dongdaxiang 398004ece0 disable sys/wait.h to fix windows compile problem, include scope in lodtensor_printer
6 years ago
dongdaxiang 39362a8415 move root_scope->DropKids() into Finalize() so that we do not have to drop all the kids
6 years ago
dongdaxiang a0b59773af fix code style
6 years ago
dongdaxiang 365be5d559 support win32 flag in io.cc shell.cc, fix code style problem in fleet_wrapper, fix lodtensor_printer_test problem
6 years ago
dongdaxiang dc8cf36e4b add more example on datagenerator
6 years ago
dongdaxiang 6bf796df14 refine print fetch list
6 years ago
dongdaxiang cf1360643f add printer for fetch variable
6 years ago
Jacek Czaja 2632327429 [MKL-DNN] Tensor modifications revert (#16462)
6 years ago
Zeng Jinle 69cb9792ea
Merge pull request #16506 from sneaxiy/revert-16424-fix_allocator_bug
6 years ago
sneaxiy 5656fa9f7c fix travis ci
6 years ago