papparametric osv design studies – precision and quality assurance via updated statisticser

Upload: ali-ebrahimi

Post on 06-Mar-2016

11 views

Category:

Documents


0 download

DESCRIPTION

More accurate parametric estimations during the conceptual phase are of paramount importance for proper early ship designAli Ebrahimi1, Per Olaf Brett et al

TRANSCRIPT

  • Parametric OSV Design Studies precision and quality assurance via updated statistics

    Ali Ebrahimi1, Per Olaf Brett1,2, Henrique M. Gaspar3 , Jose Jorge Garcia1, yvind Kamsvg1,3

    ABSTRACT

    More accurate parametric estimations during the conceptual phase are of paramount importance for proper

    early ship design. Balancing vessel designs means ensuring that a vessel particulars are set to represent a

    conceptual design format, which will meet the performance yield expectations of involved stakeholders at an

    earliest possible stage of the design process. This paper addresses how different early rule of thumb

    approaches to offshore support vessel design solution balancing have been out of date, or rather outdated,

    regarding quality assurance and statistical accuracy. The paper suggests ways of handling such a quality

    assurance issue by introducing and integrating parametric OSV design studies with multivariate data

    analysis MDA techniques to properly extract the more accurate estimating transfer functions in the earlier

    stages of the vessel design process.

    KEY WORDS

    Offshore Support Vessel Design, Parametric Vessel Design Analysis, Quality assurance of regression equations

    INTRODUCTION PARAMETRIC OSV DESIGN

    The objective of the parametric design procedure is to establish a consistent parametric description of the vessel in the early

    stages of vessel design, starting from the basic design principle that a certain description of a vessel should be able to perform

    efficiently a given mission (Parsons 2004; Gaspar 2013; Gaspar and Erikstad, 2009). Our paper discusses recent studies of such

    methods applied to the design of offshore support vessels (OSVs).

    The early stages of ship design, requires a consistent definition of a candidate base design in terms of just its main dimensions

    and other shape factors. This description can be then optimized with respect to some measure(s) of merit or subjected to various

    parametric trade-off studies to establish the basic definition of the design to be developed in more detail. In determining the

    main dimensions for a new ship, guidance can be taken from a similar ship for which basic details are known. This is known

    as a base vessel and must be similar in type, size, speed and power to the new vessel, which is constantly referred to as the

    new design is being developed (Watson 1977).

    Parametric models already exist within the marine design literature for common class of vessels such as Watson and Gilfillan

    (1998), for commercial ships; Nethercote and Schmitke, for SWATH vessels ; Fung for naval auxiliaries and Schneekluth and

    Bertram parametric models (Parsons, 2004). Any design models from the literature are, however, always subject to

    obsolescence as transportation practices, regulatory requirements, and other factors evolve over time (Watson, 1977).

    Considering current literature review and naval architecture books they still shows there is lack of updating of comprehensive

    parametric models for initial design of different OSV segments, with punctual exceptions such as Erikstad and Levander (2012).

    Our work demonstrates how utilization of different Multivariate Data Analysis techniques can be useful to extract more accurate

    parametric design knowledge from available OSVs data sources. To produce knowledge, different relevant data analysing and

    statistical data mining methods are applied. This paper contains some results of parametric studies on an Ulstein internal data

    set which is an integrated data base from most recent updated IHS Fairplay (IHS Fairplay, 2014), Marine Base (IHS Petrodata,

    2014) and Construction Vessel Base (IHS Petrodata, 2014), data bases for OSVs and OCVs.

    1 Ulstein International AS, Norway 2 Norwegian University of Science and Technology, Marine Systems Group, Norway 3 Aalesund University College, Faculty of Maritime Technology and Operations, Norway

  • A significant portion of collated data from these resources are raw data which has not been subjected to processing or any other

    manipulation, containing errors and missing values, not validated in different (colloquial) encoded formats, and we suspect

    the confirmation or citation (Barrass, 2004). Information used in all further parametric design calculation is the preface of a

    consistent and more robust result, which shows the importance of where this information comes from, how accurate are the

    values and how reliable are the sources. Due to the importance of utilizing appropriated data to produce conceptual vessel

    designs, it is essential to clean these data. Data cleaning is the process of detecting and correcting (or removing) corrupt or

    inaccurate records from a record set, table, or database, which refers to identifying incomplete, incorrect, inaccurate or

    irrelevant parts of the data and then replacing, modifying, or deleting this dirty data or coarse data. Data cleaning will increase

    data quality in the preprocessing stage, which will improve robustness of achieved parametric design knowledge. Data quality

    is a state of completeness, validity, consistency, timeliness and accuracy that makes data appropriate for a specific analysis.

    Multivariate statistics can extract better information and produce required knowledge for further designs and marketing

    strategies. Generated knowledge from this process is important for proper decision making processes to achieve better market

    based features of OSVs, which is calibrated according to statistical models of existing fleet and observed market trends.

    There are some differences between the presented approaches with traditional parametric methods, mainly regarding the sample

    size in a pre-set database, depth of utilization of statistics in the study process and multivariate aspects in data processing and

    regression models applied. Results of our study demonstrates that, to increase the robustness of parametric vessel design in a

    preliminary conceptual design phase, updated statistics of available world fleet is an appropriate source of information which

    will help designers to perform more accurate OSV design equations, compared to utilizing generally published design equations

    in naval architect books. For instance, Figure 1 demonstrates how the result of block coefficient (Cb Equations 1 to 4)

    estimation for the PSV- segment based on generally published equations, cannot straight forward be applied for Cb evaluation

    of OSVs. Therefore, developing specific equations based on nonlinear regression models can be useful to achieve more accurate

    and reliable results.

    = . Equation 1: Alexander

    = . + . . + . Equation 2: Gillfilan = . + . ! " # Equation 3: Japanese study, commented by Jensen = . + . %&. . '. + . %&. . '. Equation 4: Ulstein developed

    Obtained results due to practical OSV design experiences mainly reflects lower influence of Froude Number Fn in Cb

    estimation of OSV segment, which is shown in Figure 2. There are some reasons that justifies the result of different design

    routines of OSVs compared to commercial vessels. According to Ulstein internal design experience, lower proportion of middle

    parallel body to total length in addition to a lower speed range of OSV segment compared to other commercial vessels, have a

    vital role, which is observed in Cb and further displacement estimations. Lower parallel middle body in this segment makes

    better maneuverability and sea keeping behavior, which is essential in the operation of OSVs. However, it may reduce Cb of a

    vessel and as a consequence OSVs will have lower payload capacity compared to similar length size cargo carrying commercial

    vessels such as tankers.

    Figure 1: Cb calculated based on different equations Figure 2: Froude Number vs Cb different equations

  • We, therefore, state the importance of a deeper statistical multivariate study on OSV segments in conceptual design phase rather

    than solely relying on published literatures for commercial vessels in the same size ranges.

    MULTIVARIATE DATA ANALYSIS WITHIN OSV DATA Multivariate Data Analysis is providing a relevant statistical method to be able to extract proper knowledge from these sources

    of available data according to defined dependent and independent variables (Hair et al., 2010). Figure 3 presents the Ulstein

    process flowchart for applying multivariate data analysis in conceptual design phase of OSVs, based on the knowledge

    discovery from data base concept from Piatetsky-Shapiro (Fayad et al., 1996). The procedure consists of the following elements:

    i) Data base integration; ii) Data cleaning; iii) Clustering analysis; iv) Multivariate regression analysis; v) Parametric Equation

    development and parametric study of clusters; and vi) Data validity and sensitivity analysis.

    Multivariate Data Analysis MDA refers to any statistical technique used to analyze data that arises from more than one variable.

    This essentially models reality where each situation, product, or decision involves more than a single variable. Despite the

    quantum of data available, the ability to obtain a clear picture of what is going on and make intelligent decisions is a challenge.

    When available information is stored in database tables, Multivariate Analysis is used to process the information in a

    meaningful fashion. Applied multivariate data analysis methods utilized for making inferences regarding the mean and

    covariance structure of several variables, for modeling relationships among variables, and for exploring data patterns that may

    exist in one or more dimensions of the data (Timm, 2002).

    MDA analyses are typically applied in practical OSV design processe with N observations which constitute the world OSV

    segment fleet data base on p variables (vessel particulars). As with univariate data analysis, probability distribution of the

    Figure 3 - Multivariate parametric OSV design process

  • population is the multivariate normal (MVN) distribution. Multivariate techniques are used to classify or cluster objects into

    categories, via cluster analysis, classification and regression trees (CART), classification analysis and neural networks (Timm,

    2002). MDA helps to the detection and description of relationships among variables in large population spaces by categorizing

    objects into proper clusters.

    The most accomplished database used for OSV multivariate studies is an integration of different databases (based on IMO

    number and vessel name) in the way to achieve more complete and useful starting point information. Available data depends

    mainly on the vessel segment, since there exist general information for all segments such as delivery year, main dimensions,

    particulars, capacities and owners, besides specific information for each segment such as winch pull, survivors, J-lay tension

    and so on.

    Available OSV data sets, mostly contains huge amount of missing values and inaccuracies, varying in different segments.

    Observation in current vessel data, for instance in DWT part of PSV segment, shows 17% of missing values as well as around

    14% of inaccurate data. While in AHTSs 94% of vessels contains DWT information, around 25% of provided values are

    inaccurate. Generally main dimensions have highest accuracy and availability in data set with more than 90% of reliability for

    different segments while combination of availability and accuracy of data for main particulars such as DWT, deck area and

    installed power generally is not more than 75%. Vessel price and main machinery information available data for different

    segment generally does not exceed 50% of fleet. The density of wrong values and outliers in an original database will influence

    the final result and mislead design decisions. According to data quality literature, we are dealing with issues such as: Accuracy,

    Integrity, Cleanliness, Correctness, Completeness and Consistency (Dongre, 2004). Evaluating data quality enables to

    determine the usability of data and to establish the processes necessary for improving data quality to makes data appropriate

    for a specific use.

    A cleaning process can be carried out in automated, manual or combined format, which depends on the type and volume of

    data (Dongre, 2004). Practical experience shows, for our OSV parametric study, that performing combined data cleansing is

    the most effective way where logical error types in data structure are corrected through programmed cleansing process. Manual

    intervention is used to deal mainly with missing data and correcting outliers which are detected in automated process wherein

    neither a logical conclusion can be drawn nor rules can be formulated about the value that a particular field will take.

    There are several data cleaning methods that can be used, including Statistical, Clustering, Pattern-based and Association rules

    (Maimon, 2010). According to the nature of available data in the OSV-base all these methods are applied by Ulstein to clean

    up different objects. To identify outlier fields and records using the values such as mean, standard deviation, range is the first

    statistical step to find variation of each field in the data base. 1 or 1.5 standard deviation area in normally distributed dimensions

    and particulars is acceptable range for cleaning outliers. Dimensional ratios such as L/B, B/T and L/D are better cleaned up

    based on statistical methods, which requires no specific pattern and are usually normally-distributed around the average value.

    Figure 4 demonstrates examples of applying statistical cleaning on L/B ratios for OCV ship types.

    Figure 4: B/D ratio range over average Figure 5: DWT cleaning standard deviation from trend

    line

  • Finding patterns among datasets, which shows similar characteristics or behavior of variables, is another way of data cleaning

    data. Utilizing trend line in 2-D scatter plot will help to fill out missing data. For instance to determine missing DWT values

    for PSVs, regression line for DWT with Lbp*B*T can be a very useful cleaning object. The same procedure can be used to

    detect outliers and extremes from the standard deviation, coming up with a more uniform data set. Figure 5 presents an example

    of applying standard deviation on trend line for PSV segment larger than 2000 ton. The mentioned pattern is used to fill out

    missing values at an initial stage.

    Classification and statistical data grouping is another useful multivariate tool to clean data and detect more accurate patterns

    inside clusters. Data clustering is a data exploration technique that allows objects with similar characteristics to be grouped

    together in order to facilitate their further processing. Discriminant analysis is used to evaluate group separation and to develop

    rules for assigning observations to groups. Cluster analysis is concerned with group identification. The goal of cluster analysis

    is to partition a set of observations into a distinct number of unknown groups or clusters in such a manner that all observations

    within a group are more similar, while observations in different groups are less similar (Bischof, 1999). Identifying groups of

    individuals or objects that are similar to each other but different from individuals in other groups is intellectually satisfying and

    profitable (Dongre, 2004). Using initially, cleaned OSV data and implementing clustering analysis enables designers and

    analysts to find out statistically, similar mission requirements and functionalities where the target is the expectations in

    subgroups that are most likely to be representative for a certain market segment to be targeted in the analysis. Figure 6 is an

    example of applying cluster analysis on the world PSV fleet larger than 2000 tonnes (metric) DWT.

    Our internal Ulstein practice shows that one of the most effective clustering methods to group the vessels based on main

    particulars is density clustering. Density clustering groups aggregates elements that have statistically closer distance to the

    mean value of the group. Density clustering is originated based on K number of vessel designer specified clusters. The K initial

    centroids are chosen in parameters and each point is then assigned to the closest centroid. A case is assigned to the cluster for

    which its distance to the cluster mean is the smallest. The action performed in the algorithm centers around finding the K-

    means (Pelleg and Moore, 2000). Eventually each collection of points assigned to a centroid generates a descriptive cluster.

    Density clustering approach have been applied in parametric study of different OSV segments at Ulstein multivariate parametric

    design model. Continual results from studies on PSV larger than 2000 tonnes DWT is demonstrated in Figure 8 and Figure 9.

    To perform a clustering study and achieve more accurate results, we selected variables with lower internal correlation. DWT

    and Engine HP are selected for clustering study, where DWT as one of the main revenue earning factors of PSVs and engine

    power contributing to the capital costs and operational cost. These driver factors have significant role in the decision making

    process of this segment. On the other hand, significant influence of engine HP on the vessel speed could not be neglected in

    variable selection process. The result of extensive clustering and further statistics manipulation of the existing OSV fleet is

    depicted in Figure 6 where, after running some iterations, eventually six clusters are selected, which provides acceptable

    statistical results for each cluster. This study covers all PSV fleet larger than 2000 tonnes built after 1975. A second iteration

    contains vessels built after 2005, which covers 75% of large size PSVs. It is demonstrated that a substantial improvement since

    2005 in the OSV market is the case, therefor to reflect the influence of market change in parametric evolution of the fleet,

    vessels built after 2005 have been selected for the second part of the study.

    PSV

    180001600014000120001000080006000400020000

    7000

    6000

    5000

    4000

    3000

    2000

    Engine_HP_Total

    Dw

    t

    1

    2

    3

    4

    5

    6

    C42

    Scatterplot of Dwt vs Engine_HP_Total

    Figure 6: PSV fleet K-means cluster Figure 7: Distribution of generated

    cluster

  • We obtain a better understanding of existing trends among different clusters and finding similar groups of PSVs, based on

    average parameters by generating frequency histogram and presenting the most popular clusters among data populations. As

    observed in Figure 10, Clusters 2, 5 and 6 from clustering of the total fleet and clusters 1, 3 and 5 from clustering of vessels

    built after 2005, they have a higher population in the overall market sample. The next step is to find out similarity among

    extracted groups, via hierarchical clustering. Hierarchical clustering considers every case being a cluster in itself. However at

    successive steps, similar clusters are merged together and clusters are formed (Maimon, 2010). The algorithm loses information

    at every step, ending with every previous division back to one common cluster (Figure 11).

    cluster2-5cluster2-1cluster1-6cluster1-5cluster2-3cluster1-4

    100,00

    100,00

    100,00

    100,00

    Variables

    Sim

    ilari

    ty

    DendrogramSingle Linkage; Absolute Correlation Coefficient Distance

    Figure 10: Different clusters frequency histogram variation among clusters

    Figure 11: Hierarchical clustering of groups

    Figure 8: Main dimensions variation among

    clusters Figure 9: Main particulars variation among clusters

  • As it is observed in the Dendrogram (Figure 11), three categories of PSVs have higher population in the market, indicated in

    Table 1 by different colors. Considering characteristics of these three categories it is conspicuous that, due to size, only two

    groups of vessels are highly populated. LOA around 75 m and LOA around 89m. Third category is the result of higher Engine

    horsepower as it is shown in Table 1. Three cluster groups of PSVs with higher population, have been identified as shown in

    Table 2.

    Table 2: Analysis of the categories of PSVs larger than 2000 tonnes

    To eliminate the influence of time in cluster analysis all prices are adjusted to 2015, based on globally published historical data

    of industrial inflation rate (Bureau of Labor Statistics, 2014)

    The rule length constraint (89,9 m) in design explicitly demonstrated by the population of Cluster 2 and 3. Vessels larger than

    90m Loa should fulfil higher safety requirements, which increase building cost, so designer mainly attempt to define their new

    designs within this limitation, which has increased the population of large size PSVs around 89m.

    Category 2 and Category 3 clusters have similar size but a significant difference in installed power that increases considerably

    around 40%. That power difference is reflected in the final price, with an increment around 3 Mill USD.

    The cluster examples demonstrate how applying statistical classification methods will lead to better understanding of market

    behavior for designs and main particular decision making processes at initial conceptual design phase. Compared to traditional

    vessel classifying-methods, usually based on single variable such as vessel size or capacity, this method provides the possibility

    of considering more variables in a grouping procedure. The overall process of multivariate data clustering analysis is a statistical

    process, which is more robust compared to traditional classifying methods. For instance, in the case of the PSV fleet, if

    traditional vessel classification is applied based on vessel cargo carrying capacity, the results would not differentiate large size

    PSVs regarding the different power range, which influences the significant differences in price and operability of vessel as

    main decision making factors.

    MULTIVARIATE REGRESSION In OSV parametric design studies multiple linear regression is utilized to determine the most appropriate linear model to predict

    only one dependent random variable y from a set of fixed, observed independent variables x1, x2,..., xk measured without

    error. Different parameters, for instance DWT as dependent and main dimensions as independent variables, are considered.

    Linear model can be fitted using the least squares as an initial model to the data.

    Formal tests and numerous types of plots have been developed to systematically help one evaluate the assumptions of

    multivariate normality; detect outliers, select independent variables, detect influential observations and detect lack of

    independence. Single and multiple linear and nonlinear regression analysis, besides time series analysis and trend analysis of

    clusters and segments are applied in this part of multivariate parametric OSV study. In the following, we demonstrate some

    examples of applying multivariate regression analysis in different OSV segments. Some Ulstein OSV developed design

    equations are also commented.

    Built LOA (m) DWT (Tonnes) Deck Area (m2) Speed (knots) Power (HP) Price (M USD)

    Category 1 After 2005 75 - 78 3500 740 13 below 7000 23 - 27

    Category 2 After 2010 87 - 89 5000 970 13 7000 - 7900 43 - 45

    Category 3 After 2010 87 - 89 5000 970 13,5 10500 47

    Fig 10: Different clusters frequency Fig 11; Hierachical clustering of groups Table 1: Three main category PSVs larger than 2000 tonnes

  • Single regression analysis example: PSV, GT vs DWT

    Figure 12 shows that DWT and GT are highly correlated to each other. Payload capacity of vessels are considered based on

    these two important parameters. In addition to value of R-Square to evaluate significance of correlation among variables,

    residual plot and normality check of residuals is important in statistical study, which should be considered in regression based

    design estimations.

    Another example of applying regression modeling to identify dimensional ratios is shown in Figure 14, which presents ship

    shape OCV L/B ratio time series. As it is depicted for the OCV segment, L/B ratio has a negative trend in time series, which

    means newer designs have larger beam compared to older vessels. Main reason, which justifies such a trend, is high deck load

    capability and stability requirement of OCVs which convinces designers to have larger improvement in the beam of the vessels

    compared to length. However, similar studies for other dimensional ratios are possible.

    Figure 14: OCV L/B ratio time series study

    700060005000400030002000

    8000

    7000

    6000

    5000

    4000

    3000

    2000

    1000

    0

    Dwt

    Gt

    S 638.973

    R-Sq 61.9%

    R-Sq(adj) 61.8%

    Fitted Line PlotGt = 362.2 + 0.6004 Dwt

    + 0.000028 Dwt**2

    300015000-1500-3000

    99.99

    99

    90

    50

    10

    1

    0.01

    Residual

    Pe

    rce

    nt

    60005000400030002000

    2000

    0

    -2000

    Fitted Value

    Re

    sid

    ua

    l

    240016008000-800-1600

    200

    150

    100

    50

    0

    Residual

    Fre

    qu

    en

    cy

    1100

    100090

    080

    070

    060

    050

    040

    030

    020

    010

    01

    2000

    0

    -2000

    Observation Order

    Resid

    ual

    Normal Probability Plot Versus Fits

    Histogram Versus Order

    Residual Plots for Gt

    Figure 12: GT vs DWT regression Figure 13: Residual analysis of regression GT DWT curve

  • MULTIVARIATE NONLINEAR REGRESSION

    In addition to normal single regression analysis in 2-D plot, between two variables, multivariate regression is one the most

    powerful methods to generate proper equations for a particular estimation. There are many examples in ship parametric design

    which shows importance of utilizing multi regression rather than single regression which can miss lead the designer to wrong

    points and conclusions. Figure 15 shows significant correlation between length and DWT in PSV segment, which is statistically

    correct. However, it should be considered as it is demonstrated in Figure 16 and 17, there is a significant correlation between

    Beam and Length and Beam with DWT on the other hand, which means all influencing variables on the dependent variable

    (DWT) should be considered together and single regression is not a so precise tool for design estimation.

    Considering the impact of different dimensions, nonlinear multivariate regression analysis is applied on the development of

    DWT equation, since single variable regression analysis is deprived of having comprehensive consideration of all influential

    variables on dependent parameter. The result is considering the influence of L, B and T conjointly as main drivers for DWT,

    but with different correlation significance. Equation 5 depicts PSVs DWT estimation equation, which is calibrated to real vessel

    data and Figure 18 presents validation of the generated points based on DWT equation over Loa diagram for the PSV fleet

    larger than 2000 tonnes DWT.

    ()' = , +%&, , ', + , +%& ' Equation 5: PSV parametric equation for DWT

    The same procedure is applied to calculate other particulars including LWT, Engine power and Deck area for different OSV

    segments. Results after calibration with internal source of design data have a variance lower than 5% from real data, which is

    theoretically acceptable for early estimation processes.

    Sensitivity analysis and cost impact of small changes of each dimension on main particulars (DWT, LWT, power, deck area,

    building cost) is the last stage of parametric design which is significantly important for decision makers in early design phase.

    Figure 19 demonstrates sensitivity analyze on ship shape OCV segments for 10% change in beam, keeping length and draft

    Figure 15: PSV DWT vs Lbp Figure 16: PSV Beam vs Lbp Figure 17: PSV DWT vs Beam

    Figure 18: PSV DWT vs Loa, based on real data and developed equation

  • constant compared to base vessel. It is demonstrated that a 10% increment on Beam will have different impacts on DWT, LWT,

    Engine power and Deck area. Figure 20 shows the impact of changes on main dimensions of OCV segments on design

    particulars. Three steps of changes 5, 10 and 15% has been considered in this study.

    Figure 19: Example of sensitivity OCV study 10% beam

    Figure20: OCV particular sensitivity to L,B, T changes

    COMBINING PARAMETRIC STUDY AND MARKET ANALYSIS TOWARDS A DESIGN STRATEGY

    Due to the volatility of the OSV market, as well as the evolution of the OSV fleet through the last decades, it is important to

    consider, analyses and understand the reasons of evolutions, most influential parameters and their significance of influence

    which may impact final decision dramatically. A time history plot helps to identify external factors like: economic crisis, low

    oil price, new regulations, and relation of these with relevant changes in the tendency of the evolution.

    For the combination of parametric study with market analysis included, we have chosen, as example, the work performed for

    AHTS with more than 10.000 BHP. In Figure 21 relationship between building price and units delivered is reflected. It is

  • clearly shown in the figure that the vessels delivered in good market years (years with a high delivery ratio) have a higher cost

    than those delivered in bad years.

    On other hand, Figure 22 shows the evolution of the cost from two points of view: one where we connect the revenue indicators

    (bollard pull and deadweight, for AHTS) with building price (adjusted based on the evolution of industrial inflation rate (Bureau

    of Labor Statistics, 2014)) and other where revenue indicators are connected with cost driver indicators (as L*B*D and power).

    It is clear the opposite direction of both tendency lines.

    The negative tendency of the blue lines reflects that to get the same bollard pull and deadweight, it is required a higher inversion

    at the same time and from another point of view, it is required a smaller hull and less power to achieve that bollard pull and

    power combination when compared to older vessels. In another words, we can say that the efficiency of the vessels have

    increased, but this technological and efficiency improvement have also increased the price in recent years.

    The two figures above show in a time history plot, the evolution of non-dimensional ratios of AHTS with more than 10.000

    BHP. Figure 23 shows that it is possible to conclude that new regulationsas example, do not show any clear effect in the

    tendency lines (NMD, 2007).

    Figure 23: Evolution of non-dimensional ratios (AHTS)

    Figure 21: Building price and units delivered by year

    (AHTS) case)

    Figure 22: Evolution of economy ratios by year (AHTS

    case)

  • CONCLUSION

    In this paper we presented how the application of multivariate data analysis MDA in parametric design of OSVs can be useful

    during initial design stage. We defend that the proper utilization of statistical methods will help the designer to have better

    estimation of main particulars of final product based on main dimensions at initial design stage. It is shown by some case

    studies how the methodology is more accurate compared to available analytical equations, which is mainly developed for

    commercial vessels and not applicable in most cases for OSV segment. The methodology was contrasted and discussed in

    relation to merchant vessel design procedures and practices.

    The methodology introduced in this paper considers various MDA aspects to create meaningful knowledge from a source of

    row data, such as i) OSV fleet data refining (data cleaning); ii) Clustering methods and their applications; iii) linear and non-

    linear regression models; iv) data verification. Case studies demonstrated the application of methodology to find more precise

    (that is, better) equations for different OSV-segments. Proper vessel type segmenting and data-updating procedures discussed

    and recommendations proposed. Nonlinear regression equations are proposed for OSVs based on special characteristics of

    these groups of vessels. It shown in the paper how findings are verified with OSV fleet real data and specific Ulstein designs.

    The methodology is beneficial to reduce deviations from real vessel data, leading to earlier and faster design results with higher

    level of accuracy. Moreover, results claim for quality control and quality assurance reviews of existing published estimating

    equations for OSV fleet, together with a quantification of the level of deviations and accuracy validations among real data and

    results of past equations. By this paper our proposition is that an effective parametric vessel design must apply continually

    updated statistics and revised regression equations. Proper statistical methods and robust approximations must be applied in

    future parametric vessel design approaches to retain sufficient rigor in the analysis and vessel design work.

    REFERENCES

    BISCHOF, H., LEONARDIS, A. and SELB, A. "MDL principle for robust vector quantisation", Pattern Analysis and

    Applications vol. 2, p. 59-72, 1999.

    BARRASS. C.B., Ship Design and Performance For Masters And Mates, 2004 ISBN 0 7506 6000 7

    DONGRE, K., Data Cleansing Strategies. Information Management, InfoManagement Direct, October 2004October 2004

    ERIKSTAD, S.O. and LEVANDER, K., System Based Design of Offshore Support vessels, 11th IMDC, Glasgow, UK, 2012

    FAYYAD, U. PIATETSKY-SHAPIRO, G. AND SMYTH, P. From Data Mining to Knowledge Discovery in Databases, AI

    Magazine 17(3): Fall 1996, 37-54

    GASPAR, H. M. Parametric Ship Design: A Simple Application in HTML + Javascript, HIALS, 2013

    GASPAR, H.M., ERIKSTAD, S. Extending the Energy Efficiency Design Index to Handle Non-Transport Vessels.

    COMPIT09, Budapest, Hungary, 2009

    HAIR, J.F., ANDERSON, R.E., TATHAM, R.L. and BLACK, W.C., Multivariate Data Analysis, Seventh edition. 2010

    IHS Fairplay, IHS Fairplay, Q3 2014

    IHS Petrodata, Marine Base, Q3 2014

    IHS Petrodata, CVBase, 2014

    MAIMON, O., ROCKACH, L. Data Mining and Knowledge Discovery Handbook, 2010

    NMD (Norwegian Maritime Directorate), Guidelines for revision of ISM-manual on supply ships and tugs used for anchor

    handling, 2007

    PARSONS, M. G. 2004 Parametric Design, Ch. 11 in Lamb, T. (ed.), Ship Design and Construction, SNAME, Jersey City,

    NJ

    PELLEG, D. AND MOORE, A., "X-means: extending K-means with efficient estimation of the number of clusters" ICML

    2000.

    SCHNEEKLUTH, H. and BERTRAM, V., Ship Design for Efficiency and Economy, 2004 second edition ISBN 0 7506

    4133 9

    TIMM, N. H., Applied Multivariate Analysis, 2nd edition 2002 ISBN-13: 978-0387953472

    WATSON, D. G. M., and GILFILLAN, A. W. (1977) Some Ship Design Methods, Transactions RINA, Vol. 119,

    WATSON D. G. M., Practical ship design, Elsevier ocean engineering book series, Volume 1, 1998