[关闭]
@liushiya 2018-11-13T03:33:32.000000Z 字数 4847 阅读 488

XGBoost Experiment Manual

机器学习 实验


你可以点击这里查看中文版本。

Motivation of Experiment

Dataset

Dataset Download Link:
* Training set: TrainingData.csv.zip
* Valadation set: ValidationData.csv.zip

Dataset Context:

The UJIIndoorLoc database covers three buildings of Universitat Jaume I with 4 or more floors and almost 110.000m2. It was created in 2013 by means of more than 20 different users and 25 Android devices.The database consists of 19937 training/reference records and 1111 validation/test records.

  1. The 529 attributes contain the WiFi fingerprint, the coordinates where it was taken, and other useful information.Each WiFi fingerprint can be characterized by the detected Wireless Access Points (WAPs) and the corresponding Received Signal Strength Intensity (RSSI). The intensity values are represented as negative integer values ranging -104dBm (extremely poor signal) to 0dbM. The positive value 100 is used to denote when a WAP was not detected.
  2. During the database creation, 520 different WAPs were detected. Thus, the WiFi fingerprint is composed by 520 intensity values.
  3. Then the coordinates (latitude, longitude, floor) and Building ID are provided as the attributes to be predicted.The particular space (offices, labs, etc.) and the relative position (inside/outside the space) where the capture was taken have been recorded. Outside means that the capture was taken in front of the door of the space.
  4. This Dataset also include the information about who (user), how (android device & version) and when (timestamp) WiFi capture was taken is also recorded.

Dataset Content:

Experiment Environment

Experiment Form

Individual completion

Experimental procedure

Install and invoke XGBoost

  1. pip 安装:
  2. conda 安装:
    • (linux) conda install xgboost

Algorithm step

  1. Using 'pandas' or other packages to load the experiment dataset.
  2. Combining 'BuildingID' and 'Floor' into a unique property as the target of the forecast.
  3. Attributes 001 to 520 (WAP001-WAP520) are appropriately processed and converted into data acceptable to XGBoost as data input..
  4. Setting appropriate parameters for the XGBoost.
  5. Invoking the Python API of XGBoost to train.
  6. Predicting the result in the Validation set and requiring that the accuracy of result is above 90 percentage.

Grading standard

Rating item Proportion Description
Attendance 40% You can ask for leave from the college if special circumstances exist
Code is valid 20% Valid code means that there is no syntax error in the code
Experiment report 30% Mainly checking whether fill in the template of the experiment carefully
Code specification 10% Mainly checking whether name the variables standardly

提交方式

Submission process

  1. Visiting the website 222.201.187.50:7001
  2. Clicking on the corresponding submission entry
  3. Filling in your name, student number, and then uploading the report with the format of pdf and the code compression package with the format of zip.

Precautions


Reference documentation

https://xgboost.readthedocs.io/en/latest/parameter.html (Refer to the parameter description in the content of the website.)

Any comments or suggestions can be directly reflected in the QQ group to the assistants.

添加新批注
在作者公开此批注前,只有你和作者可见。
回复批注