Contents lists available at ScienceDirect

Computers & Industrial Engineering 79 (2015) 27–41

Computers & Industrial Engineering

j o u r n a l h o m e p a g e : w w w . e l s e v i e r . c o m / l o c a t e / c a i e

A comparative study on modelling material removal rate by ANFIS and polynomial methods in electrical discharge machining process

Khalid Al-Ghamdi, Osman Taylan

a r t i c l e

i n f o

a b s t r a c t

King Abdulaziz University, Faculty of Engineering, Department of Industrial Engineering, P.O. Box 80204, Jeddah 21589, Saudi Arabia

Article history: Received 27 April 2013 Received in revised form 26 October 2014 Accepted 29 October 2014 Available online 7 November 2014

Due to the controversy associated with modelling Electrical Discharge Machining (EDM) processes based on physical laws; this task is predominantly accomplished using empirical modelling methods. The modelling studies reported in the literature deal predominantly with quantitative parameters i.e. ones with numerical levels. In fact, modelling categorical parameters has been devoted a scant attention. This study reports the results of an EDM experiment conducted on the Ti–6Al–4V alloy. Its aim was to model the relationship between the Material Removal Rate (MRR) and the parameters of the process, namely, current, pulse on-time and pulse off-time along with a categorical factor (electrode material). The mod- elling process was accomplished using adaptive neuro-fuzzy inference system (ANFIS) and polynomial modelling approaches. In fact, one purpose of this study was to compare the performance of these mod- elling approaches as no study was found contrasting their prediction capability in the literature. Regard- ing the polynomial model, two numerical parameters (current and pulse on-time) were declared significant in the ANOVA together with the electrode material and its interaction with pulse on-time. Thus, they were all incorporated in the developed polynomial model. Furthermore, five ANFIS models with 6, 9, 19, 21 and 51 rules were developed utilizing the first order Sugeno fuzzy approach by back- propagation neural networks training algorithm. Of these, the ANFIS model with 21 rules was the best. This model also outperformed the polynomial model remarkably in terms of predicting error, residuals range and the correlation coefficient between the experimental and predicted MRR values. The study sheds light on the powerful learning capability of ANFIS models and its superiority over the conventional polynomial models in terms of modelling complex non-linear machining processes.

(cid:2) 2014 Elsevier Ltd. All rights reserved.

1. Introduction

Recent years have witnessed remarkable increase in the demand for the titanium based alloys owing to their high strength-weight ratio, excellent fracture and corrosion resistance and superior mechanical properties at elevated temperature. An extensive utilization of these alloys has been observed in the aerospace industry (Armendia, Garay, Iriarte, & Arrazola, 2010; Moiseev, 2005). Indeed, many of the aero-engine components are produced using these alloys due, in no small part, to their relatively lightweight which helps to reduce the aircraft’s overall weight thereby moderating fuel consumption (Singh & Khamba, 2007). Moreover, titanium alloys are noticeably replacing the traditionally used materials in aerospace industry such as aluminium (Lütjering & Williams, 2007). They are also used increasingly in other indus-

⇑ Corresponding author. Tel.: +966 500031056; fax: +966 26952486.

Keywords: EDM MRR Polynomial model Neuro-fuzzy model Non-conventional machining

trial and commercial applications such as, chemical processing, pollution control, surgical implantation, nuclear waste storage, food processing and petroleum refining. A major hindrance for the wider use of titanium alloys is the difficulties associated with machining them conventionally. Being chemically reactive to almost all tool materials, they tend to weld the cutting tools while machining thereby accelerating tool failure (Hartung, Kramer, & Von Turkovich, 1982). The damage to cutting tools is compounded by the fact that the heat generated at the tool-work-piece interface while machining is concentrated at the tool’s cutting edges. This heat cannot be transferred through the work-piece due to the poor thermal conductivity of titanium alloys. Moreover, a very high mechanical stress is accumulated on the immediate vicinity of the cutting edge as a result of high strength and hardness that these alloys retain at elevated temperature poses a further imped- iment to their machine-ability (Che-Haron & Jawaid, 2005; Trent & Wright, 2000). Ezugwu and Wang (1997) asserted that it was due to the limitations associated with machining these alloys that large companies such as Rolls-Royce and General Electric invested large

E-mail addresses: kaaalghamdi@kau.edu.sa (K. Al-Ghamdi), otaylan@kau.edu.sa (O. Taylan).

http://dx.doi.org/10.1016/j.cie.2014.10.023 0360-8352/(cid:2) 2014 Elsevier Ltd. All rights reserved.

formed the examined ANN techniques in predicting the perfor- mance measures whereas comparable performance of the two approaches was reported by (Pradhan & Biswas, 2010).

sums of money in developing techniques to deal with them. Ezugwu, Bonney, Da Silva, and Cakir (2007) argued that, despite these limitations, machining will remain the favoured option for manufacturing complex titanium alloys at a competitive cost in the foreseeable future.

Some gaps were noted in the current literature on modelling die-sinking EDM process. First, while some of the studies com- pared the performance of ANN with that of ANFIS, none of them contrasted the prediction capability of polynomial models with that of ANFIS. This paper attempts to fill this void by comparing the performance of a polynomial model with a Neuro-Fuzzy model derived based on different number of rules and fuzzy inference systems. Second, the parameters examined in pub- lished modelling studies were, by and large, quantitative i.e. their levels were numerical. In fact, scant attention has been accorded to the case of correlating qualitative (categorical) parameters with the key performance measures of EDM, partic- ularly when polynomial modelling was adopted. In this study, the relationship between three numerical parameters along with a categorical one and MRR was modelled using a polynomial function as well as a Neuro-Fuzzy system. The implications of incorporating a qualitative factor and the way in which it was dealt with are elaborated on. Third, in the context of machining Ti–6Al–4V, Caydas and Hascalik (2008) appears to be the only study that addressed the modelling of the EDM process of this alloy. The present work extends that of Caydas and Hascalik (2008) in two aspects. First, while Caydas and Hascalik (2008) examined the influence of three parameters; namely, pulse on- time, pulse off-time and current on the electrode wear and recast thickness, the current research assessed the impact of the same three parameters (at different levels) along with the effect of electrode material on the MRR, which was not examined in the above mentioned study. Second, although the response sur- face method was the only modelling approach adopted in Caydas and Hascalik (2008), this paper utilizes and compares the perfor- mance of two methods; namely, polynomial modelling and ANFIS, bearing in mind that, to date, the latter has not been applied to model the die-sinking EDM of Ti–6Al–4V.

Non-conventional machining processes provide effective alter- natives to the conventional ones in dealing with the technical dif- ficulties associated with machining titanium alloys. Of these, Electrical Discharge Machining (EDM) is the most widely known and used process for the manufacture of engineering components (Aspinwall, Soo, Berrisford, & Walder, 2008). In this process, the material removal takes place as a result of the discharge of energy between a tool and a work-piece, which are separated by a small gap filled with a dielectric fluid. The process involves a succession of discrete discharge pulses initiated using a DC pulse generator for a certain duration followed by a similar period during which deion- isation of the dielectric occurs and the gap is flushed of debris (McGeough, 1988). A very high temperature is generated by the discharge energy at the point of the spark on the work-piece and the material is removed by melting and vaporization. The stability of the process is maintained by a servo-controlled mechanism whereby a constant gap between the tool and the work-piece is sustained whilst impressing the former pre-selected shape (Singh, Maheshwari, & Pandey, 2004). The EDM theory was postu- lated by two Soviet scientists in the middle of 1940s (Ho & Newman, 2003). Its process is very effective in machining materi- als of any hardness given that they conduct electricity (Abbas, Solomon, & Bahari, 2007; Yan, Chung, & Yuan Huang, 2005). A major advantage of the EDM process is its electro-thermal nature which eliminates the need for any direct contact between the elec- trode and work-piece during machining thereby eradicating the mechanical stress, chatter, tool deformation and vibration prob- lems (Tsai & Masuzawa, 2004). A recent review of titanium alloys EDM is reported in (Kumar, Singh, Batish, & Singh, 2012). The scope of the current work is centred on the EDM of the Ti–6Al–4V alloy, which accounts for 50–60% of total titanium alloy production (Hood, Lechner, Aspinwall, & Voice, 2007).

The methods adopted in the current study are firstly described. A discussion of the experimental design and procedures follows. The obtained results are then presented and discussed. After com- paring the performance of the investigated modelling approaches, the paper culminates with the main conclusions.

2. Methodology

Material Removal Rate (MRR) is one of the key performance measures in EDM. The ability to rigorously understand its mecha- nism and accurately predict its values based on the well-known physical laws are still debatable (Markopoulos, Manolakos, & Vaxevanidis, 2008; Tsai & Wang, 2001a). In fact, the large number of parameters involved and their interdependence together with the stochastic mechanism of the EDM process appear to signifi- cantly impair the ability to establish a scientifically admissible theory correlating the process variables with the main perfor- mance measures such as the MRR. Consequently, this task is predominantly accomplished by means of empirical or semi- empirical modelling methods. Broadly, two approaches are adopted; namely, the use of conventional statistical models and the application of soft computing algorithm such as ANFIS. For instance, the second order (quadratic) polynomials, conventionally employed in response surface method, were used to relate the EDM process parameters with its key performance measures in the case of machining cobalt-bonded tungsten carbide (Puertas, Luis, & Alvarez, 2004), silicon carbide (Luis, Puertas, & Villa, 2005), Al–4Cu–6Si alloy–10 wt.% SiCP composites (Dhar, Purohit, Saini, Sharma, & Kumar, 2007) and DIN 1.2714 steel (Zarepour, Tehrani, Karimi, & Amini, 2007). Artificial neural network (ANN) techniques, on the other hand, were used to model and predict the main EDM performance characteristics in the case of machin- ing aluminium and iron (Tsai & Wang, 2001b) and various steel grades (Panda & Bhoi, 2005; Pradhan & Biswas, 2010). The perfor- mance of ANN was compared with that of the ANFIS in two studies. In one of these (Puertas et al., 2004), the ANFIS method outper-

Design of Experiments (DoE) is an effective and efficient approach for exploring, understanding and empirically modelling the cause-and-effect relationship between the engineering pro- cesses’ parameters and their performance measures. An experi- mental investigation involves purposeful changes of the input variables of a process or system to explore and justify the changes that may occur in the output response (Montgomery, 2010). Generally a DOE study is performed in three stages: plan ning, conducting and analysis and interpretation (Antony, 2003). The planning stage involves recognizing the problem or the improve- ment opportunity, stating the objectives, selecting the perfor- mance measure(s) and the measurement system(s), determining the factors that may influence the chosen performance measure(s), choosing the levels for the factors, finding an appropriate design to vary the factor levels in accordance with and assigning the factors and interactions to the selected design. Having accomplished these tasks, the next stage is to carry out the experiment as planned. Finally, the obtained results should be analyzed and interpreted. In so doing, two approaches were adopted in this study; namely, the polynomial modelling approach and the ANFIS.

28 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

2.1. Polynomial modelling approach

(Puertas et al., 2004; Luis et al., 2005; Dhar et al., 2007; Zarepour et al., 2007). For ‘m’ factors, a model of this order comprises (m + 1)(m + 2)/2 terms and takes the form that is given in Eq. (2):

m X

m X

m X

ð2Þ

^y ¼ ^b0 þ

^bixi þ

^biix2

^bijxixj

i þ

i

i¼1

i¼1

Polynomial modelling is a major phase of the statistical analysis of experimental data which comprises four stages: estimation, hypothesis testing, modelling and identifying the best parameters setting. The estimation involves obtaining a number to quantify the impact of each of the studied parameters and their interactions on the investigated performance measure(s). To accomplish this task in the current study, a measure that was initially proposed by Kelley (1935) and labelled the Percentage Contribution (PC) by Ross (1996) was adopted. This was obtained using Eq. (1).

PC ¼

(cid:3) 100

ð1Þ

SSP (cid:2) ðdfpÞMSE SStotal

where ^b0 is the intercept, ^bi is the coefficient of the linear effect of factor i, ^bii is the coefficient of the quadratic effect of factor i and ^bij is the coefficient of the interaction effect between factors i and j. Having developed an appropriate empirical model for the process under study, the next step is to identify the best combination of the parameters values. This task can be accomplished using such graphical tools as main effects and interactions plots. Alternatively, formal optimization techniques such as linear and non-linear pro- gramming (Taha, 2010) may be employed using the developed empirical model as an objective function to be maximized or mini- mized as appropriate.

2.2. Adaptive neuro-fuzzy inference system (ANFIS) for EDM processes

Fuzzy systems and ANNs are two of the major soft computing approaches. The relationship between neural networks and linguis- tic knowledge based system is bidirectional and broadly discussed in (Ishibuchi, Nii, & Turksen, 1998). Neural network-based systems can be trained by numerical data and fuzzy rules can be extracted. On the other hand, fuzzy rule-based systems can be designed and developed by linguistic knowledge. Normally, the fuzzy logic and neural systems have very contrasting applications. For instance, fuzzy systems are appropriate if sufficient expert knowledge about the process is available, while neural systems are useful if sufficient measurable data of process are available (Taylan & Karagözog˘lu, 2009). Both approaches build nonlinear systems based on bounded continuous variables. However, they are different in the sense that neural systems are treated in a numerical quantitative manner, whereas fuzzy systems are treated in a symbolic qualitative man- ner. In this study, a class of adaptive neuro-fuzzy networks with the ultimate aim of designing a Fuzzy Inference System (FIS) via learning is proposed. The network structure and learning algorithm imply a systematic approach for the FIS design.

where SSp is the parameter’s Sum of Squares, dfp is its degrees of freedom and the MSE is the mean square error of the experiment. The PC gives the percentage each parameter contributes to the total performance measure’s variation, and is a measure of how much the performance could be improved if the parameter was controlled accurately. The quantities required for estimating the PC can be obtained from the Analysis of Variance (ANOVA) which is the main technique for accomplishing the hypothesis testing task (the process through which the statistical significance of each of the examined parameter and interaction is tested). The ANOVA involves decomposing the observed total variance into compo- nents due to different sources of variation (factors, interactions and experimental error). Dividing the estimated variance for each parameter by that of error lends a random variable that follows an F-distribution and can be used to test the statistical significance of the concerned parameter. However, when the experiment is not replicated, there will be no degrees of freedom to estimate the error variance. In such cases, it is necessary to identify the factors or interactions that can reasonably be pooled to provide an esti- mate of the error variance. A powerful tool for accomplishing this task is the Half Normal probability Plot (HNP) (Daniel, 1959) of which the use is conventionally confined to analyzing unreplicated two-level experiments. Essentially, HNP can be constructed for any design provided that the plotted effect estimates are uncorrelated and have the same variance. For three-level designs, Wu and Hamada (2011) showed how HNP can be used in the case of decomposing the studied factors into single degree of freedom components with equal variance. Whitcomb and Oehlert (2007) proposed an alternative strategy for using the HNP that relaxed the condition of splitting the effects into single degree of freedom elements. In this strategy, the sums of squares are used as alterna- tive measures of the factorial effects which are commonly esti- mated using the contrasts among the averages of response values associated with the levels of the studied factors. Furthermore, an iterative method that bears considerable resemblance with the stepwise variable selection procedure is adopted to estimate a pro- visional error variance. Dividing the sum of squares of a selected effect by this estimate renders a chi-square statistic for which a provisional p-value can be computed and translated in terms of a provisional z-score of a Half Normal distribution.

The factors and interactions that pronounced significant in the ANOVA are used to develop an empirical model correlating them with the response under study (Wu & Hamada, 2011).

An adaptive network is a neuro-fuzzy system with an overall input–output behaviour that is determined by the collection of modifiable parameters. The structure of an adaptive network is composed of nodes connected by directed links, where each node performs a function on its incoming signals to generate a signal to the output node and each link specifies the direction of signal flow from one node to another (Jang, Sun, & Mizutani, 1997). The network structure encodes fuzzy ‘If-then’ rules’ in which the conse- quent outcome is a function of the input variables. Fuzzy sets are considered to be advantageous in the logical field, and in handling higher order operations easily. The basic advantage of the ANFIS is the addressing of fuzzy linguistic parameters and their term sets that must be identified by the designer. In general, the designer chooses the types of membership functions (MFs). The number of membership functions for each input space might be a drawback for an ANFIS design besides the parametric learning and structure learning problems dealing with the partition on the input–output universes (Taylan & Darrab, 2011). In order to overcome the draw- back of ANFIS for the rule generation, the fuzzy sub-clustering approach is generally used. In this method, rules are generated based on the clustering of input and output data set. The number of rules is usually equal to the number of output clusters regardless of the number of input variables. Each data point is regarded as a potential cluster centre and a measure of likelihood defining the cluster centre based on the density of surrounding data points is calculated. In order to generate an ANFIS structure, a cluster radius

Polynomials are very powerful for empirically modelling the cause-and-effect relationship between the process parameters (X1, X2..Xm) and the performance measure of interest (Yi). Mathe- matically, this relationship is developed using Taylor series expan- sion of terms involving the studied factors and interactions. Generally, a polynomial model is said to be of order ‘N’ if this is the power of one or more of its factor terms or if ‘N’ is the sum of the powers of the factors involved in one of its interaction terms. Polynomials of second order are commonly used in modelling EDM

29 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

error term ei was defined, as the derivative of the error measure Ep with respect to the output of node i and it is given in Eq. (6).

ð6Þ

ei ¼

@þEp @xi

Then the weight update wki for on-line learning is calculated by

Eq. (7).

ð7Þ

¼ (cid:2)g

Dwki ¼ (cid:2)g

¼ (cid:2)geixk

@þEp @xi

@þEp @wki

@xi @wki

must be specified to indicate the range of influence of the cluster. In this approach, specifying a small cluster radius will yield many small clusters in the data set, and will produce too many rules. Hence, the number of MFs will be high and the optimality of outcomes will be lower. To improve the performance of a fuzzy system, the parameters in terms of the structure, complexity, type of networks, etc., have to be determined and tuned (Taylan & Darrab, 2011). On the other hand, an ANFIS model is able to pro- duce crisp numerical outcomes. It includes the following steps: (1) defining input and output variables by linguistic statements, (2) deciding on the fuzzy partition of the input and output spaces, (3) choosing the MFs for the input and output linguistic variables, (4) deciding on the fuzzy control rules, (5) designing the inference mechanism and (6) choosing a de-fuzzification procedure.

where g is the learning rate that affects the convergence speed and stability of the weights during learning. For off-line learning, the connection weight wki is updated only after the presentation of entire data set or after the specified epochs are completed as given in Eq. (8).

2.2.1. ANFIS model development by BP MLP

ð8Þ

Dwij ¼ (cid:2)g

; Dwki ¼ (cid:2)g

@þEp @wij

@þEp @wki

This corresponds to a way of using the true gradient direction

based on the entire data set.

The aim of this study is to model the relationship between the MRR and the input parameters of the examined EDM process, namely, current, pulse on-time, pulse off-time and electrode mate- rial. As the modelling task will be accomplished using adaptive neuro-fuzzy inference system (ANFIS) and polynomial modelling, another purpose of the study is to compare the performance of these modelling approaches taking in consideration the presence of a qualitative factor; namely, the electrode material.

3. Experimental design and procedures

The Back-Propagation (BP) Multilayer Perceptrons (MLP) is an adaptive network whose nodes perform the same function on incoming signals; this node function is usually a composite of weighted sum and it is a differentiable nonlinear activation func- tion. The most commonly used activation functions in BP MLPs are logistic, hyperbolic tangent, and identity functions. The first two functions provide smooth, nonzero derivatives with respect to input signals. Sometimes they are referred to as squashing func- tions since the inputs to these functions are squashed to the range [0,1] or [(cid:2)1,1]. On the other hand, they are also called sigmoidal functions because of their S-shaped curves that exhibit smoothness and asymptotic properties. For an ANN, the node function for the output layer is considered to be a weighted sum of squashing functions. This is equivalent to a situation in which the activation function is an identity function, and output nodes of this type are often called linear nodes. The net input x of a node is defined as the weighted sum of the incoming signals plus a bias term. For instance, the net input and output of node j are calculated by Eq. (3). and Eq. (4), respectively.

36 X

ð3Þ

Net xj ¼

wijxi þ wj

i¼1

ð4Þ

xj ¼ f ðxjÞ ¼

1 1 þ expð(cid:2)xjÞ

In performing the experiment, an ONA D-2030-S Die sinking EDM machine with programmable pulse current and on/off time was used. Kerosene was used as dielectric and was supplied to the electrode/work-piece interface using two adjustable nozzles at a flow rate of 2 litres/minute. As already pointed out, the work-piece material was the Ti–6Al–4V alloy of which detailed properties are presented in (Ezugwu & Wang, 1997). This was sup- plied in bars which were cut into 15 mm long cylindrical discs each with a diameter of 20 mm. Regarding the electrode, each material type was also provided in bars and was cut into cylindrical discs each with 50 mm length of 12 mm diameter. For each experiment, new electrode was used so that the wear associated with the pre- ceding run exerts no effect on the following ones. This also helped ensure that all the experiments start at the same electrode conditions.

where xi is the output of node i located in any one of the previous layers, wij is the weight associated with the link connecting nodes i and j, and wj is the bias of node j. Since the weights wij are actually internal parameters associated with each node j, changing the weights of a node will alter the behaviour of the node and in turn alter the behaviour of whole BP algorithm. In this study, a BP algorithm forming the steepest descent method for MF estimation was employed.

The training error is the difference between the training data output value, and the output of the fuzzy inference system corresponding to the input training data value. The training error records the mean squared error (MSE) of the training data set at each epoch. The backward error propagation is calculated by the following steps; first, a squared error measure for the pth input– output pair is defined by Eq. (5).

X

ð5Þ

Ep ¼

ðdk (cid:2) xkÞ2

k

For the EDM process of Ti–6Al–4V, Caydas and Hascalik (2008) conducted an experimental study aiming to develop an empirical model to correlate pulse on-time, pulse off-time and current with electrode wear and recast thickness. A further parameter; namely, electrode material was examined in this study and its influence together with that of the aforementioned three parameters on the MRR was investigated. Three levels were chosen for each of the studied parameters as detailed in Table 1. A full factorial three-level experiment comprising 34 = 81 runs was conducted. Consequently, there were 80 degrees of freedom to estimate all the investigated factorial effects. Based on the effect of sparsity principle, it was deemed very unlikely that all of these 80 effect components are significant i.e. the process is likely to be driven primarily by only some of the parameters’ main effects and interactions. On this premise, the variability associated with the insignificant effects was attributed to the experimental error and hence was utilized to estimate its variance. By so doing, it was pos- sible to analyze the experimental data without replicating its runs which would have entailed performing a prohibitive number of

where dk is the desired output and xk is the actual output for the node k when the input part of the pth data pair was presented. To minimize the error measure, firstly the gradient vector was obtained. In order to calculate the gradient vector, a form of deriv- ative beginning from the output layer was identified which is going backward layer by layer until the input layer is reached. Hence, an

30 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

31 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

Table 1 Machining parameters of the Ti–6Al–4V EDM experiment. Variable parameters Levels 1 2 3

irrespective of the pulse on-time, the highest MRR was observed with the graphite electrode. Moreover, for all of the three electrode materials, the MRR increased when the on-time increased from 100 to 150 ls. However, when pulse on-time was changed from 150 to 200, MRR decreased with different rates depending on the electrode material. While a comparable decrease rates were noted with graphite and copper, the least reduction rate was observed with the aluminium electrode. In fact, with long pulse on-time per- iod (>150 ls), arcing was observed with all of the electrode mate- rials. Consequently, the discharge efficiency was compromised rendering a noticeable decrease in MRR.

A B C D Pulse current (A) Pulse off-time (ls) Pulse on-time (ls) Electrode material 5 50 100 Graphite 10 75 150 Copper 15 100 200 Aluminium Fixed parameters Level

trials (at least 162). The adoption of principal of sparsity is not uncommon in the experimental design literature (Montgomery, 2010). Its validity has been empirically demonstrated by Li, Sudarsanam, and Frey (2006) and Bergquist, Vanhatalo, and Nordenvaad (2011).

A polynomial model was used to represent the relationship between the parameters and the interaction that were declared significant in the ANOVA and the MRR. As a categorical factor, one way of dealing with the electrode material is to develop a sep- arate model consisting only of the quantitative factors for each of its levels rendering three separate models. This approach was adopted by (Sahoo, Routara, & Bandyopadhyay, 2009) in examining the impact of different work-piece materials. Helpful though in terms of avoiding explicit modelling of categorical factors, this approach becomes quite cumbersome as the number of these factors and their levels increases. In fact, explicit modelling of categorical factors has the advantage of being more practical as it yields a single predication equation. Moreover, it renders one esti- mate of the experimental error variance and more residual degrees of freedom than fitting several regression models. Therefore, this approach was adopted in this study.

In each of the performed 81 experiments, the work-piece was machined for a fixed period of time i.e. 15 min using a positive electrode polarity. The order in which the experiments were conducted was randomly determined. The main reason for select- ing a full factorial design factorial design despite its large size was the need to provide enough experimental data to train the devel- oped ANFIS model. The performance measure of interest in the current work was the work-piece MRR. This was measured by recoding the work-piece weight before and after the EDM opera- tion using a micro-level balance; the volume of the material removed was then obtained by dividing the weight differences by the Ti–6Al–4V density. The ratio of the removed material volume to the machining time rendered the MRR (mm3/min).

4. Results and discussion

The results of the experiment are given in Table 2. Their analysis using the two investigated approaches will be presented and dis- cussed separately. The prediction capabilities of the developed polynomial and ANFIS models will then be assessed and compared.

4.1. Polynomial modelling approach

The HNP of the MRR was constructed as shown in Fig. 1. Clearly, pulse current (A), its on-time (C), electrode material (D) and the interaction of pulse on-time and electrode material (CD) stand out as potentially significant. To test their statistical significance, ANOVA was used as displayed in Table 3. Evidently, all of the examined effects are significant because their p-value being smal- ler than a = 0.05.

Electrode material was the most influential parameter as it accounted for 60.86% of the MRR variability. Pulse on-time had a higher PC (19.43%) than its interaction with electrode material, which explained 9.12% of the variability. Having a PC value of 4.57%, pulse current was the least influential among the significant parameters. Collectively, the considered factorial effects explained 93.98% (60.86 + 19.43 + 9.12 + 4.57) of the MRR variability render- ing an error PC of 6.02% (100–93.98).

To model the process under study, a model matrix was con- structed from the design array given in Table 2. This is a coded matrix that has a column for each term in the model to be devel- oped. As three main effects and one interaction of three-level fac- tors were found significant, ten terms were needed. Of these four corresponded to the linear and quadratic components of the two quantitative factors: pulse on-time and current. The qualitative factor (electrode material) has two terms associated with its two degrees of freedom. Finally, the interaction between the electrode material and pulse on-time has four terms corresponding to the four possible combinations of the linear and quadratic elements of pulse on-time and the two components of the electrode mate- rial. To construct the model matrix, it was necessary to code the levels of the studied parameters so that they become dimension- less and can be used to estimate comparable model coefficients. Coding also improves the precision with which the coefficients are estimated (Montgomery, 2010). Linear main effects of the quantitative parameters were coded so that the highest and lowest levels of each factors become 1 and (cid:2)1 respectively. Any value in between can be calculated so that its difference from 1 and (cid:2)1 is proportional to the difference between the corresponding actual value and the actual highest and lowest values. The codes for the quadratic main effect can then be obtained by squaring the coded linear main effect levels. Using this system, a least squares linear main effect coefficient is interpreted as the average change in the response variable per unit change in its factor. The quadratic main effect coefficient, on the other hand, is a measure of how its factor changes when its own values are changed. The components of the electrode material factor can be coded by assigning one dummy variable for each of the three electrode types. For each variable, a value of one can be assigned to the runs where the appropriate electrode material was used while a value of zero should be writ- ten for the remaining runs. This entails adding three dummy vari- able columns of which the row-wise sum is one, rendering a column identical to the one that should be used to estimate the model intercept. Consequently, it will not be possible to estimate any of the model coefficients as the model matrix will be singular. One way out of this problem is to drop one of the three dummy variables, in which case, each of the coefficients of the remaining

Fig. 2(a) depicts the plot of the main effect of current on MRR. Evidently, the MRR increased linearly when the pulse current increased. This is due to the increase in the discharge energy that accompanied the rise of current, which caused an increase in the molten metal volume. With regards to pulse on-time and electrode material, their main effect plots should not be relied upon owing to their interdependence which resulted in subtler patterns of MRR as can be seen from Fig. 2(b). This interaction plot shows that

Dielectric Machining time (min) Dielectric flow rate (l/min) Kerosene 15 2

32 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

Table 2 Results of the performed EDM experiment. No. Current Off time On time Electrode No. Current Off time On time Electrode MRR MRR

42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 75 100 100 100 50 50 50 75 75 75 100 100 100 50 50 50 75 75 75 100 100 100 50 50 50 75 75 75 100 100 100 50 50 50 75 75 75 100 100 100 150 150 150 150 200 200 200 200 200 200 200 200 200 100 100 100 100 100 100 100 100 100 150 150 150 150 150 150 150 150 150 200 200 200 200 200 200 200 200 200 Copper Copper Copper Copper Copper Copper Copper Copper Copper Copper Copper Copper Copper Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium Aluminium 15.59 14.75 17.48 19.31 2.25 1.44 8.17 2.03 5.96 4.28 0.91 4.22 5.85 3.75 9 8.35 6.51 8.49 9.04 2.69 4.9 6.37 11.65 19.88 26.04 16.28 11.64 23.49 16.46 12.66 19.06 7.18 14.46 15.96 14.77 13.57 16.89 12.68 17.06 12.04 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 15 5 10 50 50 50 75 75 75 100 100 100 50 50 50 75 75 75 100 100 100 50 50 50 75 75 75 100 100 100 50 50 50 75 75 75 100 100 100 50 50 50 75 75 100 100 100 100 100 100 100 100 100 150 150 150 150 150 150 150 150 150 200 200 200 200 200 200 200 200 200 100 100 100 100 100 100 100 100 100 150 150 150 150 150 Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Graphite Copper Copper Copper Copper Copper Copper Copper Copper Copper Copper Copper Copper Copper Copper 17.21 24.7 23.1 16.45 25.2 31.12 21.39 23.45 30.3 33.69 39.41 36.84 33.73 36.82 38.95 33.31 35.84 38.19 24.75 24.75 30.69 18.69 18.16 27.31 24.14 26.18 27.7 10.27 11.86 17.27 12.5 12.53 16.18 11.68 15.11 11.93 13.9 17.82 19.5 16.76 16.09

Table 3 ANOVA for MRR. Source Sum of squares df Mean square F-value p-value PC (%)

26.56 112.98 353.87 26.50 <0.0001 <0.0001 <0.0001 <0.0001 4.57 19.43 60.86 9.12 179.28 762.68 2388.76 178.91 6.75 A C D CD Residual Total 358.56 1525.35 4777.52 715.66 472.52 7849.61 2 2 2 4 70 80

levels. This system was adopted in this paper for it yields polyno- mial coefficients equivalent to the effect estimates conventionally computed in ANOVA analysis in the cases where the model matrix is orthogonal. To illustrate this, let YG; YC and YA denote respec- tively the average response values associated with the levels graphite, cooper and aluminium of the electrode material and let Y00 be the overall average. For the two components of the electrode material, two main effect coefficients ^bD1 and ^bD2 will be estimated. The former is equal to YG (cid:2) Y00 whereas the latter is equal to YC (cid:2) Y00. Although it is not included in the model, the effect asso- ciated with the third level i.e. aluminium (^bD3) can be obtained from the other two coefficients. In fact, its value equal the sum of two estimated coefficients multiplied by (cid:2)1 i.e. ^bD3 ¼ (cid:2)^bD1þ ^bD2). As it can be seen from the three curves in Fig. 2(b), the MRR has a quadratic behaviour when varied across the values of pulse

two variables will estimate the net contribution or consequence of its corresponding level relative to the excluded one. Alternatively, two columns can be used to represent the two components of the electrode material factor. The first can be formed by associating 1, 0 and (cid:2)1 with the levels graphite, cooper and aluminium respec- tively while the second may be constructed by assigning 0, 1 and (cid:2)1 respectively to the graphite, cooper and aluminium electrode

Fig. 1. HNP of the MRR.

33 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

the independence assumption, the violations of the assumption of constant variance can be detected by plotting residuals against fitted values as it is given in Fig. 3 (b).

on-time. This behaviour is dependent upon the electrode material. Put differently, the curvilinear effect of pulse on-time with graph- ite electrode differs from that exhibited with cooper and alumin- ium electrodes. To account for this, the model matrix must incorporate interaction columns. These can be generated using the entry-wise multiplications of the component columns of pulse on-time and electrode material factors. The least square estimates of these interactions serve as adjusting values for the terms in the quadratic equation that represents the behaviour of the MRR over the studied range of pulse on-time for each of the examined elec- trodes. Using the coding scheme described above, the fitted model for MRR is presented by Eq. (9).

MRR ¼ 23:5 þ 2:58XA þ 0:047X2A (cid:2) 0:171XC (cid:2) 9:20X2C

ð9Þ

þ 12:8XD1 (cid:2) 6:72XD2 þ 0:7XCXD1 (cid:2) 4:51XCXD2 (cid:2) 2:93X2CXD1 þ 0:981X2CXD2

As no unusual pattern was recognized in any of the residual plots, the ANOVA results were dammed satisfactory. To assess the model coefficients sensitivity to each of the response values, Cook (1979) suggested a measure, conventionally denoted by Di, which quantify the square difference between the least squares estimates based on all the 81 response values and those obtained by deleting the ith response value. Any response value with Di > 1 is deemed influential. For the developed model, the maxi- mum Di was 0.129 indicating the robustness of its coefficients to small changes in the response values. Another measure of the model quality is the R2-adjusted which estimates the percentage of the variability explained by the model. This was found to be 93.17% implying that only 6.83% of the total variability in MRR was left unexplained. The predictive capability of the model can also be assessed using the prediction R2 which can be computed as it is given in Eq. (11).

ð11Þ

R2 prediction ¼ 1 (cid:2) ðPRESS=SStotalÞ

ANOVA was used to test the statistical significance of the terms that the above model incorporates as it is displayed in Table 4. Clearly, the terms XC, X2A, XCXD1 and X2CXD2 are not significant at a = 0.05 level and thus they should be pooled with the error esti- mate and deleted from the developed model to yield the model presented in Eq (10).

MRR ¼ 23:5 þ 2:58XA (cid:2) 9:20X2C þ 12:5XD1 (cid:2) 6:07XD2

ð10Þ

(cid:2) 4:16XCXD2 (cid:2) 2:44X2CXD1

where PRESS is the predictive residual sum of squares (Pe(i)) and e(i) = yi (cid:2) ^yðiÞ. The ^yðiÞ is the predicted value of the ith response var- iable based on a model fit to the remaining n (cid:2) 1 response values. Having an R2 prediction of 92.42%, the fitted model is expected to explain 92.42% of the variability in predicting new response values. Considering the above discussed adequacy measures, the developed model was deemed satisfactory.

4.2. ANFIS modelling approach

In order to validate the conclusions drawn from ANOVA, certain assumptions need to be satisfied. These are related to the residuals (the difference between the observed and predicted response values) which should be normally and independently distributed with a mean of zero and constant variance. The normality assump- tion can be assessed by generating a histogram of the residuals as presented in Fig. 3(c) or by plotting them onto a normal probability graph as shown in Fig. 3 (a). While plotting the residuals in time order of data generation (see Fig. 3 (d)) is useful in examining

Fig. 2. Main effect plot of pulse current (a) and interaction of pulse on-time with electrode material (b) based on ANOVA.

In this study, five first order Sugeno ANFIS models were devel- oped based on the learning capability of Back-Propagation Neural Networks (BPNNs) algorithm. The main goal of these ANFIS models is the prediction of MRR in the EDM process. Hence, the available data was divided into training and checking sets. The training data set included 51 observations which served in model building while the checking data set involved 30 observations and served for the validation of the developed model. The developed FIS is able to produce crisp numerical outcomes for the prediction of MRR (mm3/min). The FIS includes choosing the membership functions for the input variables, deciding on the types and number of fuzzy control rules, designing the inference mechanism and choosing a de-fuzzification procedure.

Table 4 ANOVA for the fitted model. Source Sum of squares df Mean square F-value p-Value

The surface of MRR is plotted in 3D graphs against pulse off time and pulse current (see Fig. 4 (a)), similarly, the MRR was plot- ted against pulse current, and pulse on time (see Fig. 4 (b)), and against pulse off time, and pulse on time (see Fig. 4 (c)).

53.11 0.24 353.87 0.01 225.73 221.06

The nonlinear relations between parameters are clearly appear- ing in these Figs. The 3D plots of parameters are very helpful for viewing the entire output surface of EDM process and the entire

XA XC XD1 X2A X2C XD2 XCXD1 XCXD2 X2CXD1 X2CXD2 358.52 1.59 3285.36 0.04 1523.77 1492.16 87.3 548.55 71.15 8.66 358.52 1.59 2388.76 0.04 1523.77 1492.16 87.3 548.55 71.15 8.66 1 1 1 1 1 1 1 1 1 1 12.933 81.27 10.54 1.28 <0.0001 0.6292 <0.0001 0.9390 <0.0001 <0.0001 0.168 <0.0001 0.001 0.261 6.75 Residual Total 472.52 7849.61 70 80

34 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

Fig. 3. Residuals plots for assessing normality (a and c), constant variance (b) and independence (d) assumptions.

the form of

span of the output set MRR based on the entire span of the input set. Fig. 4 (d) depicts two dimensional relations and plots the MRR versus pulse current. Similarly, Fig. 4 (e) and (f) plots pulse on time and pulse off time versus MRR, respectively. In most man- ufacturing problems, relationship between the response and the independent variables is unknown. The 3D and 2D plot of parameters shows that the system is highly nonlinear and suggests the development of intelligent model. The analysis of the MRR surface in relation with the studied parameters implies the existence of many local maximum and minimum responses. This is one of the process complexity indications. In fact, there are many manufacturing processes that cannot be mathematically modelled. Such processes are often satisfactorily controlled by

human beings because of their ability to interpret linguistic state- ments about the process parameters. Fuzzy systems and the other intelligent modelling techniques can provide an effective and real- istic tool to code linguistic statements with imprecise and uncer- tain information (Taylan, 2006). Soft computing techniques are very suitable for modelling the complex manufacturing processes. In these modelling approaches, the domain expert’s intuitions, feelings, and past experiences might be used to judge the magni- tude of the process dynamics and express the system’s parameters to achieve a satisfactory quality level of resulting product. For instance, fuzzy modelling represents the dynamic relations of a process by a set of fuzzy implications with input variables, state and output variables. These implications are fuzzy ‘If-then’ rules

Fig. 4. The 3D (a, b, and c) and 2D (d, e, and f) relations of input parameters versus MRR for ANFIS model.

representing the linguistic relationships of inputs and outputs parameters.

and electrode material and the output parameter of the network is the MRR. For a given input–output data pairs, the BPNN algo- rithm performs two phases of data flow. First, the input parameters pattern was propagated from the input layer to the output layer. The error signals resulting from the difference between the real process outcomes and the network outcomes (MRR) were back propagated from the output layer to the previous layers to update their weights.

The error tolerance was fixed to 0.005 for training the algo- rithm, five different ANFIS models were constituted with different number of fuzzy rules. The training errors were determined for these models and presented in Table 5. The ANFIS model developed with 21 rules was found to have the minimum percentage of error. The RMSE (Root-Mean-Square-Error) of training error and checking error was found to be 0.3021% and 0.315024% respectively. The developed ANFIS model contained a total of 117 fitting parameters, of which 72 are premise (nonlinear) parameters and 45 are conse- quent (linear) parameters.

4.2.1. Fuzzy rules and membership functions

Fuzzy rules are mathematical relationships mapping the input parameters to output which are constituted from fuzzy linguistic variables and their term sets. Fuzzy ‘If-then’ rules are known as fuzzy implications or fuzzy conditional statements that are wide- spread in our daily linguistic expressions. Fuzzy rules and fuzzy reasoning are the backbone of fuzzy inference system which are the most important modelling tools of fuzzy sets and system (Jang et al., 1997). Fig. 7 illustrates graphical representations of fine tuned membership functions for the input parameters of EDM process together with the linguistic terms set used in this study.

In this study, the number of membership functions were decided by the application of sub-clustering algorithm in which each data point belongs to a cluster at a degree specified by a mem- bership grade. The idea of clustering algorithm is to divide the out- put data into fuzzy partitions that overlap with input parameters. In this approach, the number of rules has an essential effect on how fine a control level can be achieved on the manufacturing system. Fuzzy rules are constituted from MFs and a FIS is shaped by fuzzy rules. Gaussian membership functions were employed to identify the fuzzy variables in this study. The optimal number of fuzzy rules was considered to be equal to the number of clusters. These fuzzy rules were employed in the inner loop of ANFIS to fine-tune the fuzzy model and obtain the crisp outcomes of MRR. The cluster radius indicates the range of influence of a cluster when one considers the data space as a unit hypercube. In this approach, specifying a small cluster radius will usually yield too many small clusters in the data which will result in many complications (Jang et al., 1997). Specifying a large cluster radius will usually yield a few large clusters in the data which will result in fewer rules. On the other hand, additional number of MFs does not improve the efficiency of the model. The containment of each data to each cluster is defined by a membership function and the degree of membership. The BPNN algorithm was trained based on the exper- imental data of EDM process, by modifying the appropriate param- eters until the outcomes of neural network was close to the actual outcomes. As it is given in Eq. (5), on the basis of gradient descent method for minimization of the error, the correction increments of weighting coefficient were defined to be proportional to the slope related to the changes between the error estimator and the weight- ing coefficients. The weights of algorithm were determined from the input data set. A total of 51 training data and 30 checking data for the EDM process were uniformly sampled from the input ranges and used. The training errors were determined for different number of clusters and the squash factors were presented in Table 5. The error is the difference between the training data outcomes, and the outcomes of the fuzzy inference system corre- sponding to the input parameters. The BPNNs algorithm optimizes the learning procedure by propagating the error to improve the robustness of the fuzzy inference system. Fig. 5 presents the training error of different ANFIS models for different epochs and membership functions, when the ANFIS network accomplished desired training level.

Fig. 8 depicts the fuzzy reasoning procedure for a first order Sugeno fuzzy model. Fuzzy reasoning also known as approximate reasoning which is an inference procedure deriving conclusions from a set of fuzzy if-then rules and known facts. Fuzzification is the initial state of FIS and is defined as mapping from observed inputs to fuzzy sets in a specified input universe of discourse. A fuzzy set is completely characterized by its MF. A more convenient and concise way of defining a MF is to express it as a mathematical equation. In this study, Gaussian MFs were employed to specify the degrees of membership. Gaussian membership functions are spec- ified by two parameters (c, r), where c represents the MFs center and r determines the MFs width. Fig. 7 plots the Gaussian MFs for the system parameters of ‘pulse current, pulse on time and pulse off time and a singleton MF for the electrode material. If the component ‘pulse on time’ is considered as a fuzzy variable, its membership functions might be {short, medium, and long}. Similarly, the MF of ‘pulse current’ and ‘pulse off time’ are {very low, low, moderate, high, and extremely high} and {short, medium, and long}, respectively. The general illustration of a Gaussian MF is given in Eq. (11). The mathematical equation of MF for the fuzzy linguistic term ‘average’ was constituted as presented in Eq. (12).

Þ2

ð

rð Þ ¼ e(cid:2)1=2 x(cid:2)c

ð11Þ

Gaussian x; c; r

The network under consideration is an ANFIS and has only one overall output. The learning rule specifies how the parameters should be updated to minimize a prescribed errors measure. The rules are mathematical expressions constituting the backbone of FIS and determining the discrepancy between the network’s actual output and desired output. The designed ANFIS Architectures for the EDM process was presented in Fig. 6. It is composed of four nodes in input layer, 21 nodes (H1 (cid:4) H21) in hidden layer and one node (MRR) in the output layer. As it is given in Fig. 6, the inputs of the networks are pulse current, pulse off time, pulse on time

(

for xh100 and xi200

2

lðpulse on timeÞ ¼ lmoderate ¼

0 e(cid:2)1=2ðx(cid:2)150 25 Þ

for 100 (cid:5) x (cid:5) 200

35 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

ð12Þ

Table 5 The training error of ANFIS model for different epochs.

the membership degree

is 0.726, lpulse

on

51 1.25 19 1.425 21 1.726 6 1.834 Number of clusters Squash factor Epochs 9 1.525 Error (%)

Similarly, the membership functions of the other fuzzy terms were constituted and the degree of membership was calculated for all the parameters. For instance, when the pulse on time is 150 ls, time (150) = 0.726. When the pulse on time is 180, the membership degree of 180 ls for the fuzzy term ‘moderate’ is 0.487. A similar approach can be used to specify numerically the membership

0 1000 2000 3000 4000 1.2400 0.3924 0.3523 0.3401 0.3124 0.3640 0.3440 0.3356 0.3320 0.3030 1.2500 1.5100 0.9800 0.9000 0.8972 0.3500 0.3034 0.3029 0.3024 0.3021 4.000 3.326 3.100 2.942 2.858

36 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

Fig. 5. Plot of training error of ANFIS models with different MFs.

where A and B are fuzzy sets in the antecedents, while zn = fn(x1, x2,...,xm) is a crisp function in the consequent. fn(x1, x2,...,xm) is usu- ally a polynomial in the input variables (x1, x2,...xm), but it can be any function as long as it approximately describes the output of the fuzzy model within the fuzzy region specified by the antecedent of the rule. Some sample fuzzy rules in the case of multi-input sin- gle output EDM process for MRR (mm3/min) prediction can be pre- sented as follows;

{graphite = 1,

copper = 0,

for

aluminium = (cid:2)1}

degree of each observation by fuzzy linguistic term. As Gaussian MFs were employed in this study, for any specific observation (xi), a crisp value might be mapped to the fuzzy set with a degree of membership l(xi). Apparently, the universe of discourse of elec- trode materials is discrete and contains non-ordered objects. A fuzzy set whose support is a single point in a universe of discourse with l(xi) = 1 is called singleton. The fuzzy set of electrode materi- als; graphite, copper and aluminium was described with single val- their ues, characterization.

Rule1: IF ‘Pulse current (x1) is low (5A)’ AND ‘Pulse off time (x2) is short (50 lm)’ AND ‘Pulse on time (x3) is long (100 lm)’ AND ‘Electrode Material (x4) is Graphite (1)’ THEN The MRR is 13.5 (mm3/min).

A fuzzy rule set is characterized by the collection of fuzzy ‘If- then’ rules. The collection of fuzzy rules characterizes the simple input–output relation of the manufacturing system. The Sugeno fuzzy model offers a systematic approach to generate fuzzy rules from a given input–output data set. A typical fuzzy rule in a Sugeno model has the following form;

Rule 2: IF ‘Pulse current is moderate (10 A)’ AND ‘Pulse off time is medium (75 lm)’ AND ‘Pulse on time is medium (150 lm))’ AND ‘Electrode Material is Copper (0)’ THEN The MRR (z2 = 0.515x1 - (cid:2) 0.0161x2 (cid:2) 0.0074x3 + 13.2) is 16 (mm3/min).

If x1 is A AND x2 is B; . . . ; Then zn ¼ f nðx1; x2; . . . ; xmÞ

ð13Þ

¼ anx1 þ bnx2 þ cnx3 þ . . . knxm þ rn

Rule 3: IF ‘Pulse current is extremely high (15A)’ AND ‘Pulse off time is short (50 lm)’ AND ‘Pulse on time is long (200 lm)’ AND ‘Electrode Material is Aluminium ((cid:2)1)’ THEN The MRR is 15.9 (mm3/min).

Fig. 6. The architecture of ANFIS model for EDM process MRR.

37 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

Fig. 7. The fine tuned MFs for the input parameters of the EDM process.

Fig. 8 shows the fuzzy reasoning procedure for a first order Sugeno fuzzy model. For example, to obtain a MRR of 16 (mm3/ min), Rule2 (R #2) was fired. The input parameters of this rule is that the pulse current is equal to 10, pulse off time is equal to 75, the pulse on time is equal to 150 and the electrode material

is copper. The fuzzy reasoning procedure shows that each rule has a crisp output, the overall output (Z) is obtained via weighted average. The weighted average operator is sometimes replaced with the weighted sum operators (zn) presented in Eq. (13). The resulting input–output surface (given in Fig. 4), and the plots of

Fig. 8. Fuzzy reasoning procedures for an EDM process.

the MFs (given in Fig. 7) elucidate the complexity of the modelled EDM process, and show that it could be specified by the output equation of a fuzzy rule. The fuzzy antecedent part and the linear equation of Rule #2 were presented above. The first order outputs of the developed Sugeno fuzzy model are expressed in Eq. (14).

ð14Þ

z2 ¼ 0:515 x1 (cid:2) 0:0161 x2 (cid:2) 0:0074 x3 þ 13:2

z3 ¼ 0:561x1 (cid:2) 0:061x2 þ 12:7

the MRR increases. As per the interdependence between the elec- trode material and the pulse on-time, the pattern that was recog- nized with the data that were generated based on the ANFIS model was similar to the one that was observed in the interaction plot of Fig. 2 (b). However, despite the similarity of the patterns associated with the two models, it is clear that the rates of change that they exhibited are different. Consequently, the predicted val- ues rendered by these models are different and need to be compared.

z4 ¼ 0:561x1 (cid:2) 0:061x2 þ 14:3

4.3. Comparing the performance of developed models

z5 ¼ (cid:2)0:0061x2 (cid:2) 0:0034x3 þ 26:2

z9 ¼ 0:515x1 þ 12:2

z10 ¼ (cid:2)0:0036x3 þ 17:9

z11 ¼ 0:515x1 þ 12:2

z15 ¼ (cid:2)0:0036x3 þ 17:9

In this study, five ANFIS models together with a polynomial one were developed. The ANFIS models differ in their number of rules which were 6, 9, 19, 21 and 51 rules. Based on the training error, the models that have 9 and 21 rules were the best among the examined ANFIS models. The performance of the developed mod- els was assessed and compared based on the average predictive error and the residual ranges. The average prediction error was computed using Eq. (16).

Average Prediction Error% ¼

PjExperimental value (cid:2) Predicted valuej P Experimental value

(cid:3) 100

ð16Þ

The averages of the prediction error associated with the ANFIS models with 9 and 21 rules were respectively 9.1% and 1.55% whereas the average error of the polynomial model was 10.45%. Clearly, the ANFIS models with 9 and 21 rules outperformed the polynomial model.

Regarding the firing strength of each rule, it is important to note that not all the rules were fired to obtain MRR of 16 (mm3/min) when the inputs of pulse current, its on-time and its off-time were respectively, 10, 150 and 75 and the electrode material was copper. In fact, the firing strength of rule#2 was 1, hence, w2 = 1. The firing strengths of the other rules were as follow: w3 = w4 = w5 = w9 = w10 = w11 = w15 = w17 = w18 = w20 = 0. The weighted sum of linear functions in the consequent is always crisp, the overall crisp output (Z) was determined as the weighted average of each fired rule’s output. Hence, when the pulse current (x1) is equal to 10, pulse off time (x2) is equal to 75, pulse on time (x3) is equal to 150 and the electrode material is copper (rn), the overall outcome is calcu- lated by Eq. (15).

ð15Þ

Z ¼ w1z1 þ w2z2 þ . . . þ wnzn

Z ¼ MRR ¼ 1ð0:515x1 (cid:2) 0:0161x2 (cid:2) 0:0074x3 þ 13:2Þ

þ 0ð0:561x1 (cid:2) 0:061x2 þ 12:7Þ þ . . . . . . þ 0ð(cid:2)0:0036x3 þ 17:9Þ ¼ 16ðmm3=minÞ

The residuals of each of the developed models were obtained by computing the difference between the experimental and the pre- dicted values of MRR as shown in Fig. 10. The residuals ranges from (cid:2)6.5 to 6 for the polynomial models, whereas the ranges were respectively (cid:2)0.73 to 3.71 and (cid:2)9.83 to 6.39 when the ANFIS mod- els with 21 and 9 rules were applied. Being associated with nar- rower residuals range, the polynomial model was marginally better than the ANFIS model with 9 rules; however, the ANFIS model with 21 rules was, by far, the most accurate model for pre- dicting MRR among the examined ones.

The developed ANFIS model can also be used to explain the relationship between the MRR and the process parameters in the same manner that was adopted in interpreting this relationship when the polynomial model was used. To do so, consider the plots of the current main effect and the interaction between pulse on- time and electrode materials which were constructed based on the ANFIS model with 21 rules and depicted in Fig. 9 ((a) and (b)). As was the case in the ANOVA plot, MRR exhibited a linear relationship with the current indicating that as current increases

Linear fits between the experimental data and the predicted MRR values based on the three best models were also developed to assess the degree of correlation between the actual and predicted values as shown in Fig. 11. Noticeably, the ANFIS model with 21 rules outper- formed the other two models. In fact, the coefficient of correlation between experimental MRR data and the predicted MRRs of the polynomial and the ANFIS model with 9 rules was 0.97. However, this value was 0.99 in the case of the ANFIS model with 21 rules s.

38 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

Fig. 9. Main effect plot of pulse current (a) and interaction of pulse on-time with electrode material (b) based on ANFIS model.

was smaller than the value obtained based on the polynomial one (9.2%), the superiority of the ANFIS model in approximating the relationship between the MRR and the EDM process parame- ters was confirmed.

39 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

Manifestly, the performance of the latter model in predicting the MRR was exceptionally better than the other examined models.

To assess the performance of the examined models in terms of predicting MRR based on new parameter values, a confirmation experiment was performed using new combinations of the factor levels and the results are displayed in Table 6. Also given are the predicted values obtained based on the ANFIS model with 21 rules and the polynomial model together with the error values. These were used to estimate the average prediction error of each model. As the error percentage associated with the ANFIS model (5.07%)

A final noteworthy aspect of evaluating the performance of the developed models pertains to their extrapolation capabilities. Besides its superiority in terms of prediction error and the correla- tion between experimental and the predicted MRR values, the developed ANFIS model outperforms the polynomial approxima- tion in the ability to predict MRR values based on factor levels that lie beyond the originally considered range of the quantitative parameters. To elucidate this, it is necessary to reaffirm that the experimental data were divided into training and checking sets and that the selection was performed randomly so that no bias affects the allocation of a specific part of the data to any of the two divisions. Consequently, the checking data were likely to incor- porate input parameter values that lie outside the range of the input values that were included in the training set. Considering that the training algorithm was implemented on the training set and that its performance was assessed based on the error associated with the checking data set which involved new parameter values, it can be concluded that the ability to extrapolate is an essential requirement, based on which, the ANFIS model was developed and evaluated. This is surely not the case with the polynomial mod- elling where the approximation is local and limited to the region of the parameter values used in developing the polynomial function. A relevant warning in this context is that of Rawlings, Pantula, and Dickey (1998) who stated that ‘‘Extrapolations (beyond the X-space

Fig. 10. Residuals of the polynomial and ANFIS with 9 and 12 rules models.

Fig. 11. Linear fits between experimental MRR and its predicted values.

Table 6 The predicted ANFIS MRR outcomes and their errors for EDM process. Error Error Pulse current (A) Electrode material Actual MRR outcomes (mm3/min) ANFIS MRR outcomes (mm3/min) Polynomial MRR outcomes Pulse off time (ls) Pulse on time (ls)

7 12 7 12 7 12 7 12 7 12 7 12 75 75 75 75 75 75 75 75 75 75 75 75 140 140 190 190 140 140 190 190 140 140 190 190 Graphite Graphite Graphite Graphite Copper Copper Copper Copper Aluminium Aluminium Aluminium Aluminium 33.91 35.01 29.63 29.36 16.96 20.16 5.81 7.91 18.25 18.74 18.36 12.19 32.51 36.57 28.95 30.28 15.34 19.42 4.49 7.44 18.10 19.50 21.14 12.10 1.4 (cid:2)1.49 0.73 (cid:2)0.84 1.66 0.76 1.32 0.51 0.15 (cid:2)0.76 (cid:2)2.74 0.09 33.97 36.54 26.99 29.57 16.37 18.95 6.69 9.27 14.49 17.06 14.59 17.16 0.18 4.37 1.21 0.72 (cid:2)3.48 (cid:2)6.00 15.14 17.19 (cid:2)20.62 (cid:2)8.96 (cid:2)20.53 40.77

References

40 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

Abbas, M. N., Solomon, D. G., & Bahari, F. M. (2007). A review on current research trends in electrical discharge machining (EDM). International Journal of Machine Tools and Manufacture, 47, 1214–1228. Antony, J. (2003). Design of experiments for engineers and scientists. Butterworth- Heinemann.

Armendia, M., Garay, A., Iriarte, L. M., & Arrazola, P. J. (2010). Comparison of the machinabilities of Ti6Al4V and TIMETAL (R) 54M using uncoated WC–Cotools. Journal of Materials Processing Technology, 210, 197–203.

Aspinwall, D., Soo, S., Berrisford, A., & Walder, G. (2008). Work-piece surface roughness and integrity after WEDM of Ti–6Al–4V and Inconel 718 using minimum damage generator technology. CIRP Annals-Manufacturing Technology, 57, 187–190.

Bergquist, B., Vanhatalo, E., & Nordenvaad, M. L. (2011). A Bayesian analysis of unreplicated two-level factorials using effects sparsity, hierarchy, and heredity. Quality Engineering, 23(2), 152–166.

of the data) are always dangerous but can become disastrous if the equation is not a reasonably correct representation of the true model. Any extrapolation carries with it the assumption that the correlational structure observed in the sample continues outside the sample space. Validation and continual updating are essential for equations that are intended to be used for extrapolation.’’ A fur- ther evidence of the risk associated with extrapolation when using polynomial models relates to the standard deviation (standard error) of any predicted value, which is a quadratic function of the deviation of each input parameter value from its average; and thus increases exponentially as the values of the input parameter move away from their average. Consequently, the prediction of any response value that corresponds to an input parameter level falling beyond the originally considered region is expected to have a signif- icantly larger error than those which lie within the investigated space and have closer values to the input parameters average (Draper & Smith, 1998).

Caydas, U., & Hascalik, A. (2008). Modeling and analysis of electrode wear and white layer thickness in die-sinking EDM process through response surface methodology. The International Journal of Advanced Manufacturing Technology, 38, 1148–1156.

The above discussion revealed the superiority of the developed ANFIS model over the polynomial function and unveiled the level of accuracy and generalizability improvement that can be achieved by adopting this technique in the empirical approximation of the MRR associated with the investigated EDM process.

5. Conclusions

Che-Haron, C., & Jawaid, A. (2005). The effect of machining on surface integrity of titanium alloy Ti–6% Al–4% V. Journal of Materials Processing Technology, 166, 188–192. Cook, R. D. (1979). Influential observations in linear regression. Journal of the American Statistical Association, 74, 169–174. Daniel, C. (1959). Use of half-normal plots in interpreting factorial two-level experiments. Technometrics, 1, 311–341.

Due to their complexity, the task of modelling EDM processes is predominantly accomplished by means of empirical or semi- empirical modelling methods. Conventional statistical models and soft computing algorithms are broadly the major approaches adopted in the literature. Moreover, most of the reported model- ling studies limited their scope to the examination of quantitative parameters i.e. ones with numerical levels. In fact, modelling cate- gorical parameters has been accorded a scant attention.

International supplies. Dhar, S., Purohit, R., Saini, N., Sharma, A., & Kumar, G. H. (2007). Mathematical modeling of electric discharge machining of cast Al–4Cu–6Si alloy–10wt.% SiCP composites. Journal of Materials Processing Technology, 194, 24–29. Draper, N., & Smith, H. (1998). Applied regression analysis. New York: Wiley. Ezugwu, EO., Bonney, J., Da Silva, R. B., & Cakir, O. (2007). Surface integrity of finished turned Ti–6Al–4V alloy with PCD tools using conventional and high pressure coolant Journal of Machine Tools and Manufacture, 47, 884–891. Ezugwu, E., & Wang, Z. (1997). Titanium alloys and their machinability – A review. Journal of Materials Processing Technology, 68, 262–274. Hartung, P. D., Kramer, B., & Von Turkovich, B. (1982). Tool wear in titanium machining. CIRP Annals-Manufacturing Technology, 31, 75–80. Ho, K., & Newman, S. (2003). State of the art electrical discharge machining (EDM). International Journal of Machine Tools and Manufacture, 43, 1287–1300.

Hood, R., Lechner, F., Aspinwall, D., & Voice, W. (2007). Creep feed grinding of gamma titanium aluminide and burn resistant titanium alloys using SiC abrasive. International Journal of Machine Tools and Manufacture, 47, 1486–1492. Ishibuchi, H., Nii, M., & Turksen, I. (1998). Bidirectional bridge between neural networks and linguistic knowledge: Linguistic rule extraction and learning from linguistic rules. In Fuzzy systems proceedings, the ieee international conference on computational intelligence (pp. 1112–1117). IEEE.

This study reports the results of an EDM experiment conducted on the Ti–6Al–4V alloy. Its aim was to model the relationship between the MRR and process’s parameters using ANFIS and poly- nomial modelling approaches. Regarding the latter, two numerical parameters (current and pulse on-time) were declared significant in the ANOVA together with the electrode material and its interac- tion with pulse on-time. The factors levels were coded so that the linear and quadratic impacts of the numerical factors and their interactions with the two degree of freedom components of the categorical parameters were captured effectively. The R2-adjusted of the developed polynomial model was 0.931 implying that only 6.83% of the total variability in MRR was left unexplained.

Jang, J., Sun, C., & Mizutani, E. (1997). Neuro-Fuzzy and soft computing, New Jersey. Kelley, T. L. (1935). An unbiased correlation ratio measure. Proceedings of the National Academy of Sciences of the United States of America, 21, 554–559. Kumar, S., Singh, R., Batish, A., & Singh, T. (2012). Electric discharge machining of Journal of Machining and International titanium and its alloys: A review. Machinability of Materials, 11, 84–111. Li, X., Sudarsanam, N., & Frey, D. D. (2006). Regularities in data from factorial experiments. Complexity, 11(5), 32–45.

Luis, C., Puertas, I., & Villa, G. (2005). Material removal rate and electrode wear study on the EDM of silicon carbide. Journal of Materials Processing Technology, 164, 889–896.

Lütjering, G., & Williams, J. C. (2007). Titanium. Berlin/Heidelberg: Springer Verlag. Markopoulos, A. P., Manolakos, D. E., & Vaxevanidis, N. M. (2008). Artificial neural network models for the prediction of surface roughness in electrical discharge machining. Journal of Intelligent Manufacturing, 19, 283–292.

McGeough, J. (1988). Advanced methods of machining. New York: Chapman and Hall. Moiseev, V. (2005). Titanium in Russia. Metal Science and Heat Treatment, 47, 371–376.

Montgomery, D. (2010). Design and analysis of experiments. New York: Wiley. Panda, D. K., & Bhoi, R. K. (2005). Artificial neural network prediction of material removal rate in electro discharge machining. Materials and Manufacturing Processes, 20, 645–672.

Pradhan, M. K., & Biswas, C. K. (2010). Neuro-fuzzy and neural network-based prediction of various responses in electrical discharge machining of AISI D2 steel. The International Journal of Advanced Manufacturing Technology, 50, 591–610.

Five ANFIS models with 6, 9, 19, 21 and 51 rules were developed. In so doing, first order Sugeno fuzzy approach was adopted in con- junction with back-propagation neural networks training algo- rithm. The data were divided into training and checking sets. The former included 51 observations which served in model building while the latter involves 30 observations and was used for the pur- poses of checking and validating the developed models. Of the five developed ANFIS models, the ones with 9 and 21 rules were the best. The performance of these was compared with that of the poly- nomial model. While the ANFIS model with 9 rules had a compara- ble performance with the polynomial model, the one with 21 rules outperformed both of them based on prediction error, residuals range and the correlation coefficient of the experimental and pre- dicted MRR values. The superiority of the ANFIS model was verified based on a confirmation experiment that was conducted based on new combinations of the input parameters values. This can be attributed to the flexible learning capability that ANFIS models pos- sess which can also be advantageous for extending the use of such approach from modelling to controlling EDM processes thereby paving the way for interesting future research topics.

Puertas, I., Luis, C., & Alvarez, L. (2004). Analysis of the influence of EDM parameters on surface quality, MRR and EW of WC–Co. Journal of Materials Processing Technology, 153, 1026–1032. Rawlings, J., Pantula, S., & Dickey, D. (1998). Applied regression analysis: A research tool. New York: Springer.

Ross, P. J. (1996). Taguchi techniques for quality engineering. New York: McGraw-Hill. Sahoo, P., Routara, B. C., & Bandyopadhyay, A. (2009). Roughness modeling and optimization in EDM using response surface method for different work piece materials. International Journal of Machining and Machinability of Materials, 5, 321–346.

41 K. Al-Ghamdi, O. Taylan / Computers & Industrial Engineering 79 (2015) 27–41

Singh, R., & Khamba, J. (2007). Investigation for ultrasonic machining of titanium and its alloys. Journal of Materials Processing Technology, 183, 363–367. Tsai, K. M., & Wang, P. J. (2001a). Comparisons of neural network models on material removal rate in electrical discharge machining. Journal of Materials Processing Technology, 117, 111–124.

Singh, S., Maheshwari, S., & Pandey, P. (2004). Some investigations into the electric discharge machining of hardened tool steel using different electrode materials. Journal of Materials Processing Technology, 149, 272–277. Tsai, K. M., & Wang, P. J. (2001b). Predictions on surface finish in electrical discharge machining based upon neural network models. International Journal of Machine Tools and Manufacture, 41, 1385–1403.

Whitcomb, P, & Oehlert, G. W. (2007). Graphical selection of effects in general factorials, 2007 fall technical conference of the American Society for Quality (ASQ) and the American Statistical Association (ASA). Jacksonville, FL. Wu, C., & Hamada, M. (2011). Experiments: Planning, analysis, and parameter design Taha, H. (2010). Operations research: An introduction. New York: Prentice Hall. Taylan, O. (2006). Neural and fuzzy model performance evaluation of a dynamic production system. International Journal of Production Research, 44, 1093–1105. Taylan, O., & Darrab, I. A. (2011). Determining optimal quality distribution of latex weight using adaptive neuro-fuzzy modeling and control systems. Computers & Industrial Engineering, 61, 686–696. optimization. New York: Wiley.

Yan, B. H., Chung, Tsai H., & Yuan Huang, F. (2005). The effect in EDM of a dielectric of a urea solution in water on modifying the surface of titanium. International Journal of Machine Tools and Manufacture, 45, 194–200. Taylan, O., & Karagözog˘lu, B. (2009). An adaptive neuro-fuzzy model for prediction of student’s academic performance. Computers & Industrial Engineering, 57, 732–741.

Zarepour, H., Tehrani, A. F., Karimi, D., & Amini, S. (2007). Statistical analysis on electrode wear in EDM of tool steel DIN 1.2714 used in forging dies. Journal of Materials Processing Technology, 187, 711–714. Trent, E. M., & Wright, P. K. (2000). Metal cutting. Boston: Butterworth-Heinemann. Tsai, Y. Y., & Masuzawa, T. (2004). An index to evaluate the wear resistance of the Journal of Materials Processing Technology, 149, electrode in micro-EDM. 304–309.