Publication:
Examination of Distance Based Regression Methods for Different Data Structures in Animal Science

dc.contributor.authorÖnder, Hasan
dc.contributor.authorKurnaz, Burcu
dc.date.accessioned2025-12-11T01:44:48Z
dc.date.issued2025
dc.departmentOndokuz Mayıs Üniversitesien_US
dc.department-tempOndokuz Mayıs Üniversitesi,Ondokuz Mayıs Üniversitesien_US
dc.description.abstractDistance-based regression is an alternative method for parameter estimation in linear regression models when mixed-type explanatory variables are used. Distance-based regression is similar to classical linear regression, except that explanatory variables are measured by distance measures rather than raw values. In this study, datasets with sample sizes of 10, 25, 50, 100, 250 and 500 produced for Binomial, Normal, t, Chi-square and Poisson distributions of Euclidean, Gower and Manhattan distance measures and real data with discrete and continuous distribution that body weight at sixth months was used as outcome variable, body length and chest depth at sixth months of Saanen kids were used as explanatory variables as continuous data. Milk fat ratio was determined as the response variable, while the number of milking per day and the season of Polish Holstein Friesian cattle were determined as the explanatory variables as discrete data. It was aimed to determine the effect on the data sets (10, 50 and 100 sample sizes) by comparing the results obtained from the Linear Regression method. R packages \"dbstats\", \"cluster\" and \"tidyverse\" were used to perform the analysis. As a result, it has been determined that the use of Manhattan distance in data with Poisson distribution may produce unsuccessful results, especially in small sample sizes (n<50). Although there is no significant difference between Gower and Euclidean distances in different distributions according to sample sizes, it has been determined that the use of Euclidean distance measure in some distributions produces results that cause fluctuation. However, it has been understood that the Gower distance can be recommended as a more suitable choice since it has a more stable structure. For the applicability of the Least Square Estimation method, it may be recommended to use Distance Based Regression methods in cases where the necessary assumptions mentioned in this study cannot be met.en_US
dc.identifier.doi10.34248/bsengineering.1599606
dc.identifier.endpage362en_US
dc.identifier.issn2619-8991
dc.identifier.issue2en_US
dc.identifier.startpage354en_US
dc.identifier.trdizinid1304530
dc.identifier.urihttps://doi.org/10.34248/bsengineering.1599606
dc.identifier.urihttps://search.trdizin.gov.tr/en/yayin/detay/1304530/examination-of-distance-based-regression-methods-for-different-data-structures-in-animal-science
dc.identifier.urihttps://hdl.handle.net/20.500.12712/45802
dc.identifier.volume8en_US
dc.language.isoenen_US
dc.relation.ispartofBlack Sea Journal of Engineering and Scienceen_US
dc.relation.publicationcategoryDiğeren_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.titleExamination of Distance Based Regression Methods for Different Data Structures in Animal Scienceen_US
dc.typeOtheren_US
dspace.entity.typePublication

Files