申请试用
HOT
登录
注册
 
FlashExtract: A Framework for Data Extraction by Examples

FlashExtract: A Framework for Data Extraction by Examples

da仔
/
发布于
/
1760
人观看
Various document types that combine model and viewmake it easy to organize (possibly hierarchical) data, but make it difficult to extract raw data for any further manipulation or querying. We present a general framework FlashExtract to extract relevant data from semi-structured documents using examples. It includes: (a) an interaction model thatallows end-users to give examples to extract various fields and to relate them in a hierarchical organization using structure and sequence constructs. (b) an inductive synthesis algorithm to synthesize the intended program from few examples in any underlying domainspecific language for data extraction that has been built using our specified algebra of few core operators (map, filter, merge, and pair).
15 点赞
5 收藏
0下载
相关文档
确认
3秒后跳转登录页面
去登陆