[R 데이터 분석] Using C/C++ in R , R언어에서 C/C++ 사용하기

Resources

C and C++ Basics

These are compiled languages and hence a compiler is needed to transform a human interpret able program into a machine executable one. The most commonly use compiler is gcc. This is already installed on most Linux distributions, including all the servers we have commonly used in this course. If you wish to compile C/C++ programs on your personal computer, you will need to install a compiler.
Mac users should download Xcode from Apple’s app store. Windows users should investigate Rtools or Cygwin.

There are three important differences from R

  • Variables and functions need to be declared before use.
  • Arrays are indexed from zero.
  • Arrays are stored in row-major order.
  • Statements are delimited using semicolons “;”.
  • Loops have minimal overhead and hence generally do not need to be avoided.

C Basics

We will review the following topics from C Tutorial:

  • Program Structure
  • Basic Syntax
  • Data types (Integer and Floating Point Types)
  • Variables
  • Operators (Arithmetic, Relational, Logical and Assignment)
  • Loops
  • Arrays
  • Pointers
  • Type Casting

We will not review these as they work very much like in R:

  • Decision Making

The remaining sections are mostly beyond the scope of what we will discuss here.

Calling C functions from R

We can call a function written in C from R using .C() or .call(). We begin with .C as it is simpler.

Using .C()

The basic recipe for using .C is as follows:

  • Write a function in .C that returns void and stores the desired result in one or more arguments expressly created for this purpose. Your function should have a C_ prefix.

  • Compile your function into a shared library accessible to R using this command: R CMD SHLIB my_func.c with my_func.c replaced by your c program.

  • Within an active R session, link to the shared library using dyn.load("my_func.so").

  • Write an R wrapper to call your C function using .C with the following syntax: .C("C_my_func", arg1, arg2). The reason to use an R wrapper is to ensure that the arguments passed are of the correct size and type in order to avoid potentially fatal errors.

The middle two steps can be avoided by including these functions into an R package. We will discuss this briefly when turning to C++ and Rcpp.

A conceptual key to understanding how .C works is realizing that function arguments are passed as pointers to objects in memory rather than by value.

We will look at some examples of C coding available here.

Using .Call()

We can use .Call() to create and modify R level objects directly using functions in C. R level objects are of type SEXP for “S-expression”. These types are defined in the header file Rinternals.h which should be included in all C functions to be called with .Call().

When we create R level objects, we must protect them from garbage collection using PROTECT or a similar function. We later UNPROTECT them to allow the memory to be reallocated.

Recall that scalars in R are length one vectors. Consequently, the must be coerced to C type scalars to be uses as such. This can be done using the C functions as*asLogicalasIntegerasRealCHAR(asChar()).

경축! 아무것도 안하여 에스천사게임즈가 새로운 모습으로 재오픈 하였습니다.
어린이용이며, 설치가 필요없는 브라우저 게임입니다.
https://s1004games.com

Here is a simple example using the inline::cfunction.

library(inline)
## 
## Attaching package: 'inline'
## The following object is masked from 'package:Rcpp':
## 
##     registerPlugin
seqn = cfunction(signature(n_="integer", start_="integer"), 
 body="
 int i, n = asInteger(n_), start=asInteger(start_);
 SEXP out = PROTECT(allocVector(INTSXP, n));
 for(i=0; i<n; ++i){
   INTEGER(out)[i]=i+start;
 }
 UNPROTECT(1);
 return out;
")
seqn(8, 3)
## [1]  3  4  5  6  7  8  9 10
fib = cfunction(c(n_="integer"), body=
  "
  /* Declaratations */
  int n = asInteger(n_);
  SEXP out = PROTECT(allocVector(INTSXP, n));

  /* Function Body */
  INTEGER(out)[0] = 1;
  if(n > 1)  INTEGER(out)[1]=1;
  if(n > 2)
    {
      for(int i=2; i<n; i++)
        {
        INTEGER(out)[i] = INTEGER(out)[i-2] + INTEGER(out)[i-1];
        }
    }

  /* Clean up and return */
  UNPROTECT(1);
  return out;
  ")
fib(10)
##  [1]  1  1  2  3  5  8 13 21 34 55

Modifying inputs

Be careful about modifying input arguments without duplicating first as other R objects may point to the same location in memory owing to “copy-on-modify” semantics.
Here is a quick example from p. 441 of Advanced R using the “inline” package.

add_three = cfunction(c(x="numeric"),
  "
   REAL(x)[0] = REAL(x)[0] + 3;
   return x; 
  ")
y <- x <- 1
add_three(x)
## [1] 4
x
## [1] 4
y
## [1] 4

Here is a corrected version in which the argument is duplicated rather than modified in place.

add_3 = cfunction(c(x_="numeric"),
  "
   SEXP x = PROTECT(duplicate(x_));
   REAL(x)[0] = REAL(x)[0] + 3;
   UNPROTECT(1);
   return x; 
  ")
y <- x <- 1
add_3(x)
## [1] 4
x
## [1] 1
y
## [1] 1

Interfacing with C++ via Rcpp

The Rcpp package greatly simplifies the process of exposing functions written in C++ to R.

Using sourceCpp

We can use the function sourceCpp to read a function written in C++ into R interactively. The function takes care of the compilation using R CMD SHLIB and automatically generates an R wrapper for the underlying function. The shared library and other files will be written to the directory specified by cacheDir which defaults to a temporary directory for automated clean up.

To use a C++ function via sourceCpp:

  1. Write a your C++ function in a file with extension .cpp.
  2. In the source file, be sure to #include <Rcpp.h>.
  3. Designate functions exposed to R using the tag // [[Rcpp::export]].
  4. Compile and source your function using sourceCpp() and a link to the file.

Here are some minimal examples based on generating Fibonacci numbers. You can download the source code here.

The final Fibonacci example here makes two changes from the previous example:

  1. We add the line using namespace rcpp; to avoid having to type the prefix Rcpp:: when referring to the Rcpp namespace. This is much like using library() within R to add a package to the search path.

  2. We show how to print to the R console from within the C++ function using Rcout. This can be useful for manual debugging.

#include <Rcpp.h>
using namespace Rcpp;

// [[Rcpp::export]]
IntegerVector FibCpp2(int n)
{
  // Error checking
  if(n <= 0)
    {
      throw std::range_error("n must be a positive integer");
    }

  // Allocate memory
  IntegerVector out(n);
  out[0]=1;

  Rcout << "Starting computations ... ";
  // Compute additional terms
  if(n > 0)
    {
      out[1]=1;
      int i;
      for(i=2; i<n; i++)
        {
          out[i] = out[i-1] + out[i-2];
        }
    }

  Rcout << "done." << std::endl;
  return out;
}
sourceCpp('~/Stats506/FibCPP2.cpp')
FibCpp2(10)
## Starting computations ... done.
##  [1]  1  1  2  3  5  8 13 21 34 55

Example

Here is another simple example.

Building a package with Rcpp

The sourceCpp function is convenient for interactive use and initial development. However, for larger projects with multiple functions it is better to maintain the code base using an R package. Rcpp automates much of the work involved in doing this via the compileAttributes function.

This function creates a C++ wrapper to any C++ functions tagged with the // [[Rcpp::export]] attribute that handles type conversion and an R wrapper that calls this function in turn. This makes for a relatively seamless process in which you only occasionally need to look into the translation details.

In the download above is avery simple R package Pkg with the getRegion function from the previous example.

[출처] https://jbhender.github.io/Stats506/F17/Using_C_Cpp.html

 

본 웹사이트는 광고를 포함하고 있습니다.
광고 클릭에서 발생하는 수익금은 모두 웹사이트 서버의 유지 및 관리, 그리고 기술 콘텐츠 향상을 위해 쓰여집니다.
번호 제목 글쓴이 날짜 조회 수
공지 오라클 기본 샘플 데이터베이스 졸리운_곰 2014.01.02 25085
공지 [SQL컨셉] 서적 "SQL컨셉"의 샘플 데이타 베이스 SAMPLE DATABASE of ORACLE 가을의 곰을... 2013.02.10 24564
공지 [G_SQL] Sample Database 가을의 곰을... 2012.05.20 25943
78 [Kafka] Kafka 한번 살펴보자... Quickstart file 졸리운_곰 2021.06.18 27
77 Java Kafka Producer, Consumer 예제 구현 Java를 이용하여 Kafka Producer와 Kakfa Consumer를 구현해보자. file 졸리운_곰 2021.06.18 105
76 Beginner’s Guide to Understand Kafka file 졸리운_곰 2021.06.18 23
75 [Kafka] Kafka 설치/실행 및 테스트 file 졸리운_곰 2021.06.18 36
74 [java] [kafka] [Kafka] 개념 및 기본예제 file 졸리운_곰 2021.06.16 115
73 Getting started with Apache Kafka in Python file 졸리운_곰 2020.09.10 58
72 [Kafka] 다운로드 및 Quick Start file 졸리운_곰 2020.09.07 35
71 [Kafka] 기본 개념잡기 file 졸리운_곰 2020.09.07 29
70 Flume Integration with Kafka file 졸리운_곰 2019.04.16 103
69 빅데이터: 플럼(Flume) 토폴로지 설계 file 졸리운_곰 2019.04.16 240
68 실시간 처리를 위한 분산 메시징 시스템 카프카(Kafka) file 졸리운_곰 2018.05.12 199
67 Flume과 Kafka를 사용한 초당 100만개 로그 수집 테스트 file 졸리운_곰 2018.05.12 232
66 웹 크롤링 / web crwaling / web scraping / 웹 스크래핑 file 졸리운_곰 2017.07.09 855
65 빅데이터 단지 몇퍼센트의 예측 정확성을 위하여 장애로 가득찬 빅데이터 시스템을 도입하여야 하는가에 대한 의문! file 졸리운_곰 2017.03.20 156
64 빅데이터: 플럼(Flume) 토폴로지 설계 file 졸리운_곰 2017.03.20 129
63 [실시간 분석 시스템] Apache Flume를 활용한 데이터 수집(1) file 졸리운_곰 2017.03.06 111
62 [실시간 분석 시스템] 데이터 수집 #2 Apache Sqoop을 활용하여 RDBMS 데이터 수집(2) file 졸리운_곰 2017.03.06 289
61 [실시간 분석 시스템] 데이터 수집 #2 Apache Sqoop을 활용하여 RDBMS 데이터 수집(1) file 졸리운_곰 2017.03.06 144
60 [실시간 분석 시스템] 데이터 수집 #1 오픈 소스 수집기 비교 file 졸리운_곰 2017.03.06 239
59 [실시간 분석 시스템] 일단 데이터 들여다 보기 file 졸리운_곰 2017.03.06 124
대표 김성준 주소 : 경기 용인 분당수지 U타워 등록번호 : 142-07-27414
통신판매업 신고 : 제2012-용인수지-0185호 출판업 신고 : 수지구청 제 123호 개인정보보호최고책임자 : 김성준 sjkim70@stechstar.com
대표전화 : 010-4589-2193 [fax] 02-6280-1294 COPYRIGHT(C) stechstar.com ALL RIGHTS RESERVED