SPOON use brief (ETL) --- work overtime to get new skills

-----------------Simple understanding------------- is actually Baidu's------------- ------------

ETL , short for Extract-Transform-Load in English, is used to describe the process of extracting, transforming, and loading data from the source to the destination. The term ETL is more commonly used in data warehouses , but its objects are not limited to data warehouses .

ETL includes three aspects: 
extraction: reading data from various original business systems, which is the premise of all work. 
Conversion: Convert the extracted data according to pre-designed rules, so that the originally heterogeneous data formats can be unified. 
Loading: Import the converted data into the data warehouse incrementally or in full according to the plan.

-----------------------Simple operation ---------------- Brother SHIFT talked about 1h-- ----------Very patient------The new skills I got while working overtime-----

(1) SPOON can be used after decompression

(2) After decompression,

  • Spoon.bat: Run Spoon on the windows platform.
  • Spoon.sh: Run Spoon on Linux, Apple OSX, Solaris platforms.
(3) Connect to database, initial database, target database

(4) Sometimes it doesn't work, add in the lib folder, orail18n.jar, ojdbc6.jar

(5) Input and output

(6) Connect the input and output databases and tables respectively

  Double-click the database, fill in the information, you can test whether it is connected, you can determine


(7) Drag the icon directly from the left to the job on the right, and analyze it in detail

URL: http://wenku.baidu.com/link?url=1AfKapKbPGJVCHXHVz9A-eWfz3EjYNPZgFUYA-hEeRlxBO1kdSu8dZBcxYjLrm_mUYZOE4O19JyTiE4oalxqEQlcuOM40v4iMYnT02xeZza

In fact, many functions are not used, and I have not really tried it myself. If you are interested, you can study it.



Table input and output, drag directly to the job, click Edit, and enter:

output:

Output map:

(8) Connect shift and drag the mouse


(9) If there is no problem, start converting. . . . Convert from table A to table B. . . .


-------------------------Commonly used-------------------- --------------

(1) Database query

, to be honest, I don't know how to use it myself. In general, I want to query the contents of different tables.

It is equivalent to looking up the name according to the id of table C and inserting it into B. In A, only the id has no name. 


(2) Conversion-value mapping, one-to-one correspondence between data dictionaries

double click,

(I have made several mistakes in this question. In fact, in the table conversion of AB, many value mappings can be inserted)

E.g:



-------------Tossed with the problem for hours ---------------

Encountered a SPOON error, could not save, or displayed a problem as soon as it was opened

Solution: go to (~/.kettle/shared.xml)

In the file directory, delete the (~/.kettle/shared.xml) file and restart SPOON

Attach the URL: http://forums.pentaho.com/showthread.php?93473-Unexpected-problem-reading-shared-objects-from-XML-file-null

---------------- Submitted GIT and did not submit it -------- Finally found out -- the folder didn't go in -------- ------------------------------------

You can first view git status to view the current git status

Among the files that Git needs to upload, (note: you need to go to the folder that needs to be uploaded)

Submit Git 's bush ①git add -A (upload all files) 

                                         ②git commit -am (meaning to submit all files)

                                         ③git pull origin master (master is the master corresponding to the branch where it is located)

                                         ④git push origin master (master is the master corresponding to the branch where it is located)

Guess you like

Origin http://43.154.161.224:23101/article/api/json?id=325787589&siteId=291194637