clang/clang.git - (deprecated) Vendor branches of https://git.llvm.org/git/clang.git , use clang/llvm-project instead.

	Commit message (Collapse)	Author	Age	Files	Lines
*	[CUDA][Clang][Bugfix] Add missing CUDA 9.2 case	Gheorghe-Teodor Bercea	2019-05-03	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The bug was reported on the OpenMP-dev list: .../obj-release/lib/clang/9.0.0/include/__clang_cuda_intrinsics.h:173:35: error: '__nvvm_shfl_sync_idx_i32' needs target feature ptx60\|ptx61\|ptx63\|ptx64 __MAKE_SYNC_SHUFFLES(__shfl_sync, __nvvm_shfl_sync_idx_i32, This problem occurs when trying to compile a .cu file that requires a newer ptx version (>ptx60 in this case) than ptx42. Reviewers: tra, ABataev, caomhin Reviewed By: tra Subscribers: jdoerfert, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D61474 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@359910 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Do not pass deprecated option fo fatbinary	Artem Belevich	2019-05-02	1	-1/+2
\| \| \| \| \| \| \| \| \|	CUDA 10.1 tools deprecated some command line options. fatbinary no longer needs --cuda. Differential Revision: https://reviews.llvm.org/D61470 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@359838 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Implemented _[bi]mma* builtins.	Artem Belevich	2019-04-25	1	-13/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	These builtins provide access to the new integer and sub-integer variants of MMA (matrix multiply-accumulate) instructions provided by CUDA-10.x on sm_75 (AKA Turing) GPUs. Also added a feature for PTX 6.4. While Clang/LLVM does not generate any PTX instructions that need it, we still need to pass it through to ptxas in order to be able to compile code that uses the new 'mma' instruction as inline assembly (e.g used by NVIDIA's CUTLASS library https://github.com/NVIDIA/cutlass/blob/master/cutlass/arch/mma.h#L101) Differential Revision: https://reviews.llvm.org/D60279 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@359248 91177308-0d34-0410-b5e6-96231b3b80d8
*	Basic CUDA-10 support.	Artem Belevich	2019-02-05	1	-0/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D57771 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@353232 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Propagate detected version of CUDA to cc1	Artem Belevich	2019-01-31	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \| \|	..and use it to control that parts of CUDA compilation that depend on the specific version of CUDA SDK. This patch has a placeholder for a 'new launch API' support which is in a separate patch. The list will be further extended in the upcoming patch to support CUDA-10.1. Differential Revision: https://reviews.llvm.org/D57487 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@352798 91177308-0d34-0410-b5e6-96231b3b80d8
*	Update the file headers across all of the LLVM projects in the monorepo	Chandler Carruth	2019-01-19	1	-4/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to reflect the new license. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@351636 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA][OPENMP][NVPTX]Improve logic of the debug info support.	Alexey Bataev	2018-12-12	1	-26/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Added support for the -gline-directives-only option + fixed logic of the debug info for CUDA devices. If optimization level is O0, then options --[no-]cuda-noopt-device-debug do not affect the debug info level. If the optimization level is >O0, debug info options are used + --no-cuda-noopt-device-debug is used or no --cuda-noopt-device-debug is used, the optimization level for the device code is kept and the emission of the debug directives is used. If the opt level is > O0, debug info is requested + --cuda-noopt-device-debug option is used, the optimization is disabled for the device code + required debug info is emitted. Reviewers: tra, echristo Subscribers: aprantl, guansong, JDevlieghere, cfe-commits Differential Revision: https://reviews.llvm.org/D51554 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@348930 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Fix nvidia-cuda-toolkit detection on Ubuntu	Joel E. Denny	2018-12-06	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	This just extends D40453 (r319317) to Ubuntu. Reviewed By: Hahnfeld, tra Differential Revision: https://reviews.llvm.org/D55269 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@348504 91177308-0d34-0410-b5e6-96231b3b80d8
*	Lift VFS from clang to llvm (NFC)	Jonas Devlieghere	2018-10-10	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch moves the virtual file system form clang to llvm so it can be used by more projects. Concretely the patch: - Moves VirtualFileSystem.{h\|cpp} from clang/Basic to llvm/Support. - Moves the corresponding unit test from clang to llvm. - Moves the vfs namespace from clang::vfs to llvm::vfs. - Formats the lines affected by this change, mostly this is the result of the added llvm namespace. RFC on the mailing list: http://lists.llvm.org/pipermail/llvm-dev/2018-October/126657.html Differential revision: https://reviews.llvm.org/D52783 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@344140 91177308-0d34-0410-b5e6-96231b3b80d8
*	[HIP] Support early finalization of device code for -fno-gpu-rdc	Yaxun Liu	2018-10-02	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch renames -f{no-}cuda-rdc to -f{no-}gpu-rdc and keeps the original options as aliases. When -fgpu-rdc is off, clang will assume the device code in each translation unit does not call external functions except those in the device library, therefore it is possible to compile the device code in each translation unit to self-contained kernels and embed them in the host object, so that the host object behaves like usual host object which can be linked by lld. The benefits of this feature is: 1. allow users to create static libraries which can be linked by host linker; 2. amortized device code linking time. This patch modifies HIP action builder to insert actions for linking device code and generating HIP fatbin, and pass HIP fatbin to host backend action. It extracts code for constructing command for generating HIP fatbin as a function so that it can be reused by early finalization. It also modifies codegen of HIP host constructor functions to embed the device fatbin when it is available. Differential Revision: https://reviews.llvm.org/D52377 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@343611 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Improve search for libomptarget-nvptx	Jonas Hahnfeld	2018-09-27	1	-6/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When looking for the bclib Clang considered the default library path first while it preferred directories in LIBRARY_PATH when constructing the invocation of nvlink. The latter actually makes more sense because during development it allows using a non-default runtime library. So change the search for the bclib to start looking in directories given by LIBRARY_PATH. Additionally add a new option --libomptarget-nvptx-path= which will be searched first. This will be handy for testing purposes. Differential Revision: https://reviews.llvm.org/D51686 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@343230 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Added basic support for compiling with CUDA-10.0	Artem Belevich	2018-09-24	1	-1/+6
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@342924 91177308-0d34-0410-b5e6-96231b3b80d8
*	Rename -mlink-cuda-bitcode to -mlink-builtin-bitcode	Matt Arsenault	2018-08-20	1	-2/+2
\| \| \| \| \| \| \|	The same semantics work for OpenCL, and probably any offload language. Keep the old name around as an alias. git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@340193 91177308-0d34-0410-b5e6-96231b3b80d8
*	[DEBUGINFO] Disable unsupported debug info options for NVPTX target.	Alexey Bataev	2018-07-27	1	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Some targets support only default set of the debug options and do not support additional debug options, like NVPTX target. Patch introduced virtual function supportsDebugInfoOptions() that can be overloaded by the toolchain, checks if the target supports some debug options and emits warning when an unsupported debug option is found. Reviewers: echristo Subscribers: aprantl, JDevlieghere, cfe-commits Differential Revision: https://reviews.llvm.org/D49148 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@338155 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Fixed the list of GPUs supported by CUDA-9.	Artem Belevich	2018-05-23	1	-2/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D47268 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@333098 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Added -f[no-]cuda-short-ptr option	Artem Belevich	2018-05-09	1	-2/+4
\| \| \| \| \| \| \| \| \|	The option enables use of 32-bit pointers for accessing const/local/shared memory. The feature is disabled by default. Differential Revision: https://reviews.llvm.org/D46148 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@331938 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Enable CUDA compilation with CUDA-9.2	Artem Belevich	2018-04-24	1	-0/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D45827 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@330753 91177308-0d34-0410-b5e6-96231b3b80d8
*	[NVPTX, CUDA] Added support for m8n32k16 and m32n8k16 variants of wmma ↵	Artem Belevich	2018-04-18	1	-10/+12
\| \| \| \| \| \| \| \| \| \|	instructions. The new instructions were added added for sm_70+ GPUs in CUDA-9.1. Differential Revision: https://reviews.llvm.org/D45068 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@330296 91177308-0d34-0410-b5e6-96231b3b80d8
*	[NVPTX] Emit debug info in DWARF-2 by default for Cuda devices.	Alexey Bataev	2018-04-18	1	-3/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: NVPTX target supports debug info in DWARF-2 format. Patch adds emission of debug info in DWARF-2 by default. Reviewers: tra, jlebar Subscribers: aprantl, JDevlieghere, cfe-commits Differential Revision: https://reviews.llvm.org/D42581 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@330272 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Added --[no-]cuda-include-ptx=sm_XX\|all option.	Artem Belevich	2018-04-10	1	-0/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently we always include PTX into the fatbin along with the GPU code.It about doubles the size of the GPU binary we need to carry in the executable. These options allow control inclusion of PTX into GPU binary. This patch does not change the defaults, though we may consider making no-PTX the default in the future. Differential Revision: https://reviews.llvm.org/D45495 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@329737 91177308-0d34-0410-b5e6-96231b3b80d8
*	Fix typos in clang	Alexander Kornienko	2018-04-06	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Found via codespell -q 3 -I ../clang-whitelist.txt Where whitelist consists of: archtype cas classs checkk compres definit frome iff inteval ith lod methode nd optin ot pres statics te thru Patch by luzpaz! (This is a subset of D44188 that applies cleanly with a few files that have dubious fixes reverted.) Differential revision: https://reviews.llvm.org/D44188 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@329399 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Add flag for linking runtime bitcode library	Gheorghe-Teodor Bercea	2018-03-13	1	-0/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch adds an additional flag to the OpenMP device offloading toolchain to link in the runtime library bitcode. Reviewers: Hahnfeld, ABataev, carlo.bertolli, caomhin, grokos, hfinkel Reviewed By: ABataev, grokos Subscribers: jholewinski, guansong, cfe-commits Differential Revision: https://reviews.llvm.org/D43197 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@327460 91177308-0d34-0410-b5e6-96231b3b80d8
*	Revert revision 327438.	Gheorghe-Teodor Bercea	2018-03-13	1	-38/+0
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@327447 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Add flag for linking runtime bitcode library	Gheorghe-Teodor Bercea	2018-03-13	1	-0/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch adds an additional flag to the OpenMP device offloading toolchain to link in the runtime library bitcode. Reviewers: Hahnfeld, ABataev, carlo.bertolli, caomhin, grokos, hfinkel Reviewed By: ABataev, grokos Subscribers: jholewinski, guansong, cfe-commits Differential Revision: https://reviews.llvm.org/D43197 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@327438 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Add option to generate relocatable device code	Jonas Hahnfeld	2018-02-12	1	-5/+15
\| \| \| \| \| \| \| \| \| \| \|	As a first step, pass '-c/--compile-only' to ptxas so that it doesn't complain about references to external function. This will successfully generate object files, but they won't work at runtime because the registration routines need to adapted. Differential Revision: https://reviews.llvm.org/D42921 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@324878 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Detect installation in PATH	Jonas Hahnfeld	2018-01-31	1	-15/+50
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the CUDA toolkit is not installed to its default locations in /usr/local/cuda, the user is forced to specify --cuda-path. This is tedious and the driver can be smarter if well-known tools (like ptxas) can already be found in the PATH environment variable. Add option --cuda-path-ignore-env if the user wants to ignore set environment variables. Also use it in the tests to make sure the driver always finds the same CUDA installation, regardless of the user's environment. Differential Revision: https://reviews.llvm.org/D42642 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@323848 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Added partial support for CUDA-9.1	Artem Belevich	2018-01-30	1	-5/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Clang can use CUDA-9.1 now, though new APIs (are not implemented yet. The major change is that headers in CUDA-9.1 went through substantial changes that started in CUDA-9.0 which required substantial changes in the cuda compatibility headers provided by clang. There are two major issues: * CUDA SDK no longer provides declarations for libdevice functions. * A lot of device-side functions have become nvcc's builtins and CUDA headers no longer contain their implementations. This patch changes the way CUDA headers are handled if we compile with CUDA 9.x. Both 9.0 and 9.1 are affected. * Clang provides its own declarations of libdevice functions. * For CUDA-9.x clang now provides implementation of device-side 'standard library' functions using libdevice. This patch should not affect compilation with CUDA-8. There may be some observable differences for CUDA-9.0, though they are not expected to affect functionality. Tested: CUDA test-suite tests for all supported combinations of: CUDA: 7.0,7.5,8.0,9.0,9.1 GPU: sm_20, sm_35, sm_60, sm_70 Differential Revision: https://reviews.llvm.org/D42513 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@323713 91177308-0d34-0410-b5e6-96231b3b80d8
*	Fix function call to fix build	Ismail Donmez	2017-11-29	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	../tools/clang/lib/Driver/ToolChains/Cuda.cpp:80:18: error: reference to non-static member function must be called; did you mean to call it with no arguments? if (Distro(D.getVFS).IsDebian()) ~~^~~~~~ () git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@319322 91177308-0d34-0410-b5e6-96231b3b80d8
*	Follow up of r319317, add the missing header file	Sylvestre Ledru	2017-11-29	1	-0/+1
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@319319 91177308-0d34-0410-b5e6-96231b3b80d8
*	Add the nvidia-cuda-toolkit Debian package path to search path	Sylvestre Ledru	2017-11-29	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Reported here: http://bugs.debian.org/882505 Patch by Andreas Beckmann Reviewers: Hahnfeld, tra Reviewed By: tra Subscribers: jlebar, cfe-commits Differential Revision: https://reviews.llvm.org/D40453 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@319317 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Consistently use cubin extension for nvlink	Jonas Hahnfeld	2017-11-21	1	-9/+17
\| \| \| \| \| \| \| \| \| \| \| \|	This was previously done in some places, but for example not for bundling so that single object compilation with -c failed. In addition cubin was used for all file types during unbundling which is incorrect for assembly files that are passed to ptxas. Tighten up the tests so that we can't regress in that area. Differential Revision: https://reviews.llvm.org/D40250 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@318763 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Print an error if you try to compile with < sm_30 on CUDA 9.	Justin Lebar	2017-10-25	1	-7/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: CUDA 9's minimum sm is sm_30. Ideally we should also make sm_30 the default when compiling with CUDA 9, but that seems harder than it should be. Subscribers: sanjoy Differential Revision: https://reviews.llvm.org/D39109 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@316611 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CMake][OpenMP] Customize default offloading arch	Jonas Hahnfeld	2017-10-17	1	-10/+5
\| \| \| \| \| \| \| \| \|	For the shuffle instructions in reductions we need at least sm_30 but the user may want to customize the default architecture. Differential Revision: https://reviews.llvm.org/D38883 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@315996 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Require libdevice only if needed	Jonas Hahnfeld	2017-10-16	1	-13/+4
\| \| \| \| \| \| \| \| \|	If the user passes -nocudalib, we can live without it being present. Simplify the code by just checking whether LibDeviceMap is empty. Differential Revision: https://reviews.llvm.org/D38901 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@315902 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Don't throw cudalib not found error if only front-end is required.	Gheorghe-Teodor Bercea	2017-09-26	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If we only use the compiler front-end, do not throw an error about the cuda device library not being found. This allows the front-end to be run on systems where no Cuda installation is found. Reviewers: Hahnfeld, ABataev, carlo.bertolli, caomhin, tra Reviewed By: tra Subscribers: hfinkel, tra, cfe-commits Differential Revision: https://reviews.llvm.org/D37914 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@314217 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Enable the existing nocudalib flag for OpenMP offloading toolchain.	Gheorghe-Teodor Bercea	2017-09-25	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Enable the -nocudalib flag for the OpenMP device offloading toolchain as well. Currently it can only be used for the CUDA toolchain. Reviewers: Hahnfeld, ABataev, carlo.bertolli, caomhin, hfinkel, tra Reviewed By: tra Subscribers: hfinkel, cfe-commits Differential Revision: https://reviews.llvm.org/D37913 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@314164 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Bugfix: output file name drops the absolute path where full path is ↵	Gheorghe-Teodor Bercea	2017-09-25	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	needed. Summary: When composing the output file name, the path to the file is being dropped. The full path is required. Reviewers: Hahnfeld, ABataev, caomhin, carlo.bertolli, hfinkel, tra Reviewed By: tra Subscribers: hfinkel, tra, cfe-commits Differential Revision: https://reviews.llvm.org/D37912 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@314156 91177308-0d34-0410-b5e6-96231b3b80d8
*	Revert commit with wrong message.	Gheorghe-Teodor Bercea	2017-09-25	1	-1/+1
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@314154 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Don't throw cudalib not found error if only front-end is required.	Gheorghe-Teodor Bercea	2017-09-25	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If we only use the compiler front-end, do not throw an error about the cuda device library not being found. This allows the front-end to be run on systems where no Cuda installation is found. Reviewers: Hahnfeld, ABataev, carlo.bertolli, caomhin, tra Reviewed By: tra Subscribers: hfinkel, tra, cfe-commits Differential Revision: https://reviews.llvm.org/D37914 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@314150 91177308-0d34-0410-b5e6-96231b3b80d8
*	[NVPTX] Implemented shfl.sync instruction and supporting intrinsics/builtins.	Artem Belevich	2017-09-20	1	-5/+11
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D38090 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@313820 91177308-0d34-0410-b5e6-96231b3b80d8
*	[CUDA] Added rudimentary support for CUDA-9 and sm_70.	Artem Belevich	2017-09-07	1	-37/+51
\| \| \| \| \| \| \| \| \| \| \| \| \|	For now CUDA-9 is not included in the list of CUDA versions clang searches for, so the path to CUDA-9 must be explicitly passed via --cuda-path=. On LLVM side NVPTX added sm_70 GPU type which bumps required PTX version to 6.0, but otherwise is equivalent to sm_62 at the moment. Differential Revision: https://reviews.llvm.org/D37576 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@312734 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Enable previously successful offloading tests.	Gheorghe-Teodor Bercea	2017-08-11	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Create a separate test file to contain all tests for OpenMP offloading to GPUs. Make libdevice checking more robust by accounting for the case in which no libdevice is found. This changes are in connrection with diff: D29660 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@310718 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Delete tests in openmp-offload.c which cuase failures	Gheorghe-Teodor Bercea	2017-08-10	1	-3/+7
\| \| \| \| \| \| \| \| \| \|	until a better way to perform these tests is figured out. Change connected to diff: D29654 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@310625 91177308-0d34-0410-b5e6-96231b3b80d8
*	Revert r310489 and follow-up commits r310505, r310519, r310537 and r310549	Alex Lorenz	2017-08-10	1	-14/+4
\| \| \| \| \| \| \| \| \| \| \|	Commit r310489 caused 'openmp-offload.c' test failures on Darwin and other platforms: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental_check/39230/testReport/junit/Clang/Driver/openmp_offload_c/ The follow-up commits tried to fix the test, but the test is still failing. git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@310580 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Provide a default GPU arch that is supported by	Gheorghe-Teodor Bercea	2017-08-10	1	-3/+7
\| \| \| \| \| \| \| \| \| \|	the underlying hardware. This fixes a bug triggered by diff: D29660 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@310549 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Enable executable lookup into driver directory.	Gheorghe-Teodor Bercea	2017-08-09	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Invoking the compiler inside a script causes the clang-offload-bundler executable to not be found. This patch enables the lookup for executables in the driver directory where the clang-offload-bundler resides. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, ABataev, caomhin Reviewed By: hfinkel Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D36537 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@310513 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Add flag for overwriting default PTX version for OpenMP targets	Gheorghe-Teodor Bercea	2017-08-09	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This flag "--fopenmp-ptx=" enables the overwriting of the default PTX version used for GPU offloaded OpenMP target regions: "+ptx42". Reviewers: arpith-jacob, caomhin, carlo.bertolli, ABataev, Hahnfeld, jlebar, hfinkel, tstellar Reviewed By: ABataev Subscribers: rengolin, cfe-commits Differential Revision: https://reviews.llvm.org/D29660 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@310489 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Add flag for disabling the default generation of relocatable OpenMP ↵	Gheorghe-Teodor Bercea	2017-08-09	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	target code for NVIDIA GPUs. Summary: Previously we have added the "-c" flag which gets passed to PTXAS by default to generate relocatable OpenMP target code by default. This set of flags exposes control over this behaviour. Reviewers: arpith-jacob, caomhin, carlo.bertolli, ABataev, Hahnfeld, jlebar, hfinkel, tstellar Reviewed By: ABataev Subscribers: Hahnfeld, rengolin, cfe-commits Differential Revision: https://reviews.llvm.org/D29659 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@310484 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] Make OpenMP generated code for the NVIDIA device relocatable by default	Gheorghe-Teodor Bercea	2017-08-09	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \|	Original Diff: D29642 This patch was previously reverted due to an error with patch D29654 that this depends on. git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@310479 91177308-0d34-0410-b5e6-96231b3b80d8
*	[OpenMP] OpenMP device offloading code generation produces a cubin file ↵	Gheorghe-Teodor Bercea	2017-08-08	1	-3/+88
\| \| \| \| \| \| \| \| \| \|	which is then integrated in the host binary using the host linker. Diff: D29654 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@310362 91177308-0d34-0410-b5e6-96231b3b80d8