### Three faces of FEM

#### by Yi Zhang

Finite Element Method (FEM) acquires several interpretations through its developments over decades. Assuming we are solving a PDE of form

where is partial-differential operator, is unknown function. FEM could be interpreted in the following ways (let me know if you have extra answers):

- Rayleigh-Ritz
- Galerkin method
- Method of Weighted Residuals (MWR)

The first one, Rayleigh-Ritz method, is actually how structure engineers were inspired to devise FEM. The connection of this method and FEM is the two way street of variational formulation of PDE, which gives the Euler equation of certain functional over . The derivation is done through searching for the minimum of the functional (usually some form of energy ). Rayleigh-Ritz method addresses the PDE problem by assuming the form of as

then bringing this formulation into PDE’s functional formulation, in which ‘s are acquired by solving a minimization problem. Original Rayleigh-Ritz method adopts functions with global support as . A textbook example is the boundary value problem of a beam’s modes, in which we can assume as sinusoidal functions. are called **trial functions**.

In order to have a PDE’s variational formulation in a general way, and in the same time remove some regularity requirement of the solution, Galerkin method is used. The so called **test function** are the ones with expected regularity (usually infinitive). By the procedure of Galerkin method, test functions are applied to and the integral formulation is acquired. If is linear, this integral formulation is the same as functional formulation in minimization problem by Rayleigh-Ritz method. Here we have introduced two sets of functions, trial functions which define the finite dimensional approximation of , and test functions which defines how close (accurate) the PDE is solved. The name of “test” can be understood in the following way. How close a variable is close to zero could be measured by *test* it with other variables:

where is known variable used for testing, and is proper inner product. Conceptually, the greater the variable domain consisting of , the closer to zero is. And if can make every satisfy above equation, it should be zero itself by definition.

So we can see, the space of test function defines the extension, and in turn, accuracy, the PDE stands by having

A subcategory of Galerkin method is Bubnov-Galerkin method, where test functions and trial functions are *the same*. Otherwise, it’s called Petrov-Galerkin method. Historically, trial function is also commonly referred as **shape function**.

Another abstraction of FEM is by MWR. Since all the numerical methods for solving PDE relies on moving from infinite dimensional solution spaces to finite ones, errors are introduced in all the methods. So instead of having equal to exact zero, ‘s numerical approximation introduces residual :

If numerical solution is in the form of trial function’s expansion, the coefficients (coordinates) could be acquired by restricting . In the light of above philosophy, could be restricted as to be zero *in some sense*, specifically,

for some and . This can be looked as putting weight on some locations defined by : the greater is on certain location, the more restricted is there. By this ‘s are also called **weighted functions**.

Though mathematically Galerkin method and MWR are in the same look, they are actually on different emphasis. In the former, we are specifically looking for integral form, with shifting regularity to test functions in mind, while in MWR we are focusing on minimizing residuals, and following integral form is just the math “trick” played on certain weight functions. In fact, by using different weight function in MWR, we can recover other numerical discretizations. Besides Bubnov/Petrov Galerkin methods, gives collocation method, within certain cell and otherwise gives FVM, and gives least squares method.