23 Meisai questions Y original high-quality paper explanation on second-hand sailing boats! (Including all crawled data + summary + model assumptions + problem analysis + model establishment and solution)

Table of contents

Overview of papers:

Summary:

Table of contents:

All data tables used:

First question:

Second question:

The third question:

Fourth question:


Hello everyone, the original essay on topic Y of this competition was completed last night, and now I will explain it to you.

The thesis has 59 pages in total, 7 pages for some revision instructions, 50 pages for the text, and 2 pages for the appendix
. The data unit of each company is different, and then the unit is unified, and then the tables must be integrated. . . In short, after completing the data collection, it was relatively smooth. The prediction accuracy of the training set was 98%, and the test set was as high as 88%. It is not easy for this question, which shows that the collected data is very reliable. The next few questions will be finished by doing effect analysis on the data. This article is a text explanation, and I will write it in a relatively brief way. For the description of the finished product and the detailed explanation video, etc., you can click the card at the bottom of this article


The reason why it is so long is because:
a lot of space in my thesis needs to be used to explain why I do this, basically it is to teach you how to do it step by step, and I also need to take care of everyone's level, so there will be some places that need to be written very It is cumbersome, and some intermediate processes are shown in detail, so you can delete it yourself.

This paper can guarantee originality and high quality. It is by no means a random reference to a lot of models and codes that are copied and pasted into garbage semi-finished papers that have no application to fool people.

OK, the text version is more troublesome, so I will mainly show the general process and results:

Overview of papers:

Summary:

Table of contents:

All data tables used:

First question:

Based on the prices listed for each sailboat in the spreadsheet provided, develop a mathematical model to account for them. Include any predictors you find useful. You can refer to other resources for other characteristics of a given sailboat (such as beam, draft, displacement, mast, sail, hull material, engine hours, sleeping capacity, head room, electronics, etc.) Regional economic data. Identify and describe all data sources used. Includes a discussion of the accuracy of price estimates for each sailing variant.

The first is to collect data. Here I spent a whole 8 hours overnight to crawl the official websites of various ship manufacturers to obtain the specific parameters of each model ship.

The final monomer characteristic data obtained:

Two-body:

Then crawl the economic data of each region:

Finally all summed up:

Start training, I use gbdt, the final accuracy is very high:

For each variant discuss:

Second question:

Use your model to account for the effect, if any, of region on listing prices. Discuss whether there are any consistent area effects across all sailing variants. Discuss the practical and statistical significance of any area effects.

Analyze whether it affects:

This is followed by area effect quantification:

Statistics:

Actual meaning:

Finish.

The third question:

Discuss your usability of modeling the geographic area provided and how it applies to the Hong Kong (SAR) market. Select an informative subset of sailboats from the spreadsheet provided, containing both monohulls and catamarans. Comparable list price data for this subset were found from the Hong Kong (SAR) market. Model the regional influence, if any, of Hong Kong (SAR) on the price of each sailboat in your subset. Are catamarans and monohulls the same effect?

Find Hong Kong data:

It is very difficult to find a subset and find its price in Hong Kong. I finally registered on a second-hand trading website and found the price:

Continue to do area effect analysis:

Analyze whether the effect is gdp, correlation analysis, and then single and double body difference analysis, just do a difference analysis:

Finish.

Fourth question:

It's interesting to find that you can just find some:

OK, the space is limited, the above is just a rough introduction of the graphic version, you may see it in a cloud,

Try to watch the detailed video version explanation, the finished product description and some free materials are also available, click on my personal card below:

Guess you like

Origin blog.csdn.net/smppbzyc/article/details/129910017