RstatisTik/RstatisTikPortal/RcourSe/CourseOutline/FunctionsInR/ApplyR

RstatisTik/RstatisTikPortal/RcourSe/CourseOutline/FunctionsInR/ApplyR62015-05-01 10:48:36mandy.vogel@googlemail.com52015-05-01 10:46:07mandy.vogel@googlemail.com42015-05-01 09:10:11mandy.vogel@googlemail.com32015-05-01 08:27:34mandy.vogel@googlemail.com22015-05-01 08:25:45mandy.vogel@googlemail.com12015-05-01 08:23:34mandy.vogel@googlemail.com

IntroductionEvery function in R has three important characteristics: a body (the code inside the function) - body() arguments (the list of arguments which controls how you can call the function) - formals() an environment (the “map” of the location of the function’s variables) - environment() You can see all three parts if you type the name of the function without brackets. Exceptions are primitives. Primitive functions, like sum(), call C code directly with .Primitive() and contain no R code. Therefore their formals(), body(), and environment() are all NULL.

Functions ]]> ]]>

Function ArgumentsArguments are matched first by exact name (perfect matching) then by prefix matching and finally by position. By default, R function arguments are lazy, they are only evaluated if they are actually used: ]]> ]]> ]]>

Implicit Loops

IntroductionA common application of loops is to apply a function to each element of a set of values and collect the results in a single structure. In R this is mainly done by the higher order functions: lapply() sapply() apply() tapply()

lapply()The functions lapply and sapply are similar, their first argument can be a list, data frame, matrix or vector, the second argument the function to "apply". The former return a list (hence "l") and the latter tries to simplify the results (hence the "s"). For example: ]]> ]]>

apply()apply() this function can be applied to an array. Its argument is the array, the second the dimension/s where we want to apply a function and the third is the function. For example ]]> ]]> ]]>

tapply()The function tapply() allows you to create tables (hence the "t") of the value of a function on subgroups defined by its second argument, which can be a factor or a list of factors. For example in the quine data frame, we can summarize Days classify by Eth and Lrn as follows: ]]> the class() function shows the class of an object, use it in combination with lapply() to get the classes of the columns of the quine data frame do the same with sapply() - what is the difference? try to combine this with what you learned about indexing and create a new data frame quine2 only containing the columns which are factors calculate the row and column means of the below defined matrix m using the apply function PS: in real life application use the rowMeans() and colMeans() function instead use tapply() to summarise the number of missing days at school per Ethnicity and/or per Sex (three lines) * sometimes the aggregate() function is more convenient; note the use of #!latex $\sim$; it is read as 'is dependent on'and it is extensively used in modelling ]]> ]]>

Function Exercises (Verzani)Write a function to compute the average distance from the mean for some data vector. Write a function f() which finds the average of the x values after squaring and substracts the square of the average of the numbers. Verify this output will always be non-negative by computing f(1:10) An integer is even if the remainder upon dividing it by 2 is 0. This remainder is given by R with the syntax x \%\% 2. Use this to write a function iseven(). How would you write isodd()? Write a function isprime() that checks if a number x is prime by dividing x by all values from 2,...,x-1 then checking to see if there is a remainder of 0.

Function Exercises (Verzani) SolutionsWrite a function to compute the average distance from the mean for some data vector. ]]>

Function Exercises (Verzani) SolutionsWrite a function f() which finds the average of the x values aufter squaring and substracts the square of the average of the numbers. Verify this output will always be non-negative by computing \texttt{f(1:10)} ]]> ]]>

Function Exercises (Verzani) SolutionsAn integer is even if the remainder upon dividing it by 2 is 0. This remainder is given by R with the syntax \texttt{ x \%\% 2}. Use this to write a function iseven(). How would you write isodd()? ]]> ]]> ]]> ]]>

Function Exercises (Verzani) SolutionsWrite a function isprime() that checks if a number x is prime by dividing x by all values \texttt{$2,\ldots,x-1}}}} then checking to see if there is a remainder of 0. ]]> ]]> ]]> ]]>