@andy2015
2017-08-15T03:40:54.000000Z
字数 7367
阅读 75
TODO
9.30涉及的导数场景:
初始数据:
Oracle -> MySQL(参见930.sql)
Sysmdm.region 区域表
Sysmdm.city 县级城市表
Sysmdm.noise_words 噪词表
Sysmdm.splitword_replace 分词替换表
Sysmdm.splitword_towns 省市县分词表
Sysmdm.splitword_firmkey 固定/等价次
Sysmdm.skipregionlist 相邻省份配置表,用于跨区销售判断
Sysmdm.ovalchannelrule 渠道转换表;名称——》渠道类型
Ovalmdm.saleschannel 销售渠道表
Ovalmdm.orgalias 机构别名表
Ovalmdm.orgmergelist 机构合并记录表
DCMASTER.CATEGORY
DCMASTER.CATEGORYLANG
DCMASTER.PRODUCT
DCMASTER.BUMASTER
DCMASTER.BUMASTERDETAIL
DCMASTER.ORGANIZATION
OVALMDM.OVALMASTER
DCMASTER.ORGCODECOLLATE
OVALMDM.PROJECTORG
Oracle -> Hive(由王耀辉提供)
增量数据:Oracle -> MYSQL (原始流向数据-产品清洗、主数据/对照数据)
DataHub应用界面创建周期任务
DCMASTER.PRODUCT(可能存在更新数据,导入到PRODUCT_TMP表)
DCMASTER.BUMASTER(可能存在更新数据,导入到BUMASTER_TMP表)
DCMASTER.BUMASTERDETAIL(可能存在更新数据,导入到BUMASTERDETAIL_TMP表)
DCMASTER.ORGANIZATION(可能存在更新数据,导入到ORGANIZATION_TMP表)
OVALMDM.OVALMASTER(可能存在更新数据,导入到OVALMASTER_TMP表)
OVALMDM.PROJECTORG(可能存在更新数据,导入到PROJECTORG_TMP表)
OVALMDM.ORIGINALINVENTORYDATA
OVALMDM.ORIGINALSALESDATA
OVALMDM.ORIGINALPURCHASEDATA
DCMASTER.PRODCOLLATE(姜老师提供视图)
OVALMDM.ORGNAMECOLLATE
SYNONYM_NAME | TABLE_OWNER | TABLE_NAME |
---|---|---|
DCMASTER_ORGANIZATION | DCMASTER | ORGANIZATION |
DCMASTER_ORGNAMECOLLATE | DCMASTER | ORGNAMECOLLATE |
DCMASTER_BUMASTER | DCMASTER | BUMASTER |
DCMASTER_BUMASTERDETAIL | DCMASTER | BUMASTERDETAIL |
DCMASTER_PRODUCT | DCMASTER | PRODUCT |
DCMSD_HISTORYOFMONTHSALES | DCMSD | HISTORYOFMONTHSALES |
DCNED_HISTORYOFMONTHSALES | DCNED | HISTORYOFMONTHSALES |
DCBLG_HISTORYOFMONTHSALES | DCBLG | HISTORYOFMONTHSALES |
DCBSL_HISTORYOFMONTHSALES | DCBSL | HISTORYOFMONTHSALES |
DCOTH_HISTORYOFMONTHSALES | DCOTH | HISTORYOFMONTHSALES |
DCJMKX_HISTORYOFMONTHSALES | DCJMKX | HISTORYOFMONTHSALES |
DCCHC_HISTORYOFMONTHSALES | DCCHC | HISTORYOFMONTHSALES |
DCMSD_HISTORYOFDAYSALES | DCMSD | HISTORYOFDAYSALES |
DCNED_HISTORYOFDAYSALES | DCNED | HISTORYOFDAYSALES |
DCBLG_HISTORYOFDAYSALES | DCBLG | HISTORYOFDAYSALES |
DCBSL_HISTORYOFDAYSALES | DCBSL | HISTORYOFDAYSALES |
DCOTH_HISTORYOFDAYSALES | DCOTH | HISTORYOFDAYSALES |
DCJMKX_HISTORYOFDAYSALES | DCJMKX | HISTORYOFDAYSALES |
DCCHC_HISTORYOFDAYSALES | DCCHC | HISTORYOFDAYSALES |
DCMSD_HISTORYOFWEEKSALES | DCMSD | HISTORYOFWEEKSALES |
DCNED_HISTORYOFWEEKSALES | DCNED | HISTORYOFWEEKSALES |
DCBLG_HISTORYOFWEEKSALES | DCBLG | HISTORYOFWEEKSALES |
DCBSL_HISTORYOFWEEKSALES | DCBSL | HISTORYOFWEEKSALES |
DCOTH_HISTORYOFWEEKSALES | DCOTH | HISTORYOFWEEKSALES |
DCJMKX_HISTORYOFWEEKSALES | DCJMKX | HISTORYOFWEEKSALES |
DCCHC_HISTORYOFWEEKSALES | DCCHC | HISTORYOFWEEKSALES |
DCMSD_HISTORYOFMONTHPURCHASE | DCMSD | HISTORYOFMONTHPURCHASE |
DCNED_HISTORYOFMONTHPURCHASE | DCNED | HISTORYOFMONTHPURCHASE |
DCBLG_HISTORYOFMONTHPURCHASE | DCBLG | HISTORYOFMONTHPURCHASE |
DCBSL_HISTORYOFMONTHPURCHASE | DCBSL | HISTORYOFMONTHPURCHASE |
DCOTH_HISTORYOFMONTHPURCHASE | DCOTH | HISTORYOFMONTHPURCHASE |
DCJMKX_HISTORYOFMONTHPURCHASE | DCJMKX | HISTORYOFMONTHPURCHASE |
DCCHC_HISTORYOFMONTHPURCHASE | DCCHC | HISTORYOFMONTHPURCHASE |
DCMSD_HISTORYOFSUBMONTHSALES | DCMSD | HISTORYOFSUBMONTHSALES |
DCNED_HISTORYOFSUBMONTHSALES | DCNED | HISTORYOFSUBMONTHSALES |
DCBLG_HISTORYOFSUBMONTHSALES | DCBLG | HISTORYOFSUBMONTHSALES |
DCBSL_HISTORYOFSUBMONTHSALES | DCBSL | HISTORYOFSUBMONTHSALES |
DCOTH_HISTORYOFSUBMONTHSALES | DCOTH | HISTORYOFSUBMONTHSALES |
DCJMKX_HISTORYOFSUBMONTHSALES | DCJMKX | HISTORYOFSUBMONTHSALES |
DCCHC_HISTORYOFSUBMONTHSALES | DCCHC | HISTORYOFSUBMONTHSALES |
客户清洗
上传清洗数据
AI清洗
流向规则:orgcode为M码,orgid>100 非流向规则:orgcode为M码,orgid<100
上传非流向数据,提供sql语句,select * from table(orgnamecollate,需要看存过确认) where id=上传id 给 datahub,
datahub 保存数据入hive,调我们接口 返回生成hive文件名
调算法接口,提供hive文件名
算法回调处理成功,返回新hive文件名
将新hive数据写进临时表,读取记录,根据id,添加/修改 orgnamecollate ,扩展表/字段,满足detail中, ,ai处理通过,
定时读流向数据,orgnamecollate, 根据datalastupdatetime, orgcode为M码,orgid>100数据 ,提供sql给datahub,后续逻辑同非流向
// AI清洗的结果新建一张表维护返回的结果。(AI会返回多条推荐结果,包括分词、置信度等)
构建任务
1. 读取COLLATECHECKCFG表,新增CHECKCFGEXTEND,记录为:cfgid,field,type,sql记录配置的pvalua1,pvalue2,pvalue3,pvalue4的格式为input/select/date
人工清洗
表COLLATEJOBDETAIL
字段:
COLLATEJOBDETAILID 主键
JOBID 外间关联到表COLLATEJOBCTRL
DTCODE 经销商代码
DTNAME 经销商名称
ORGNAME 客户原始名称
DTREGIONNAME 经销商省份
DTCITYNAME 经销商城市
STATUS 状态 Def=0;0未处理/1待定/2确认/3放弃/4推荐/5二次放弃/6二次推荐
keyword/region_F 经销商地址
ORIDTCODE/ORIDTNAME