AlphaGenerator/manual_prompt/2025/manual_prompt_2025122910265...

任务指令
【策略类型】优化因子
-ts_corr(ts_backfill(fscore_momentum,66),ts_backfill(fscore_value,66),756)
估值偏离波动空头策略
[[假设]]
价值因子与动量因子在股票中通常不相关，若某只股票同时呈现高动量与高价值评分相关性，则表明其股价与内在价值存在偏离——据此我们对这类股票建立空头仓位。
[[实施方案]]
使用ts_corr()函数测算过去三年间估值评分与动量评分的相关性，通过ts_backfill()算子对前期缺失数据进行回填。
[[阿尔法因子优化建议]]
建议运用分组算子与模型数据，在同类股票组内进行横向比较分析。
【数据特点】
指标数据采用EAV结构存储：
- name字段：存储指标名称
- value字段：存储具体数值
- 需要通过模糊匹配name字段查找相关指标
- 需要英文字段名
【输出要求】
1. 格式：纯Python列表，不含任何其他内容
2. 内容：字段名的模糊匹配关键词,输出基础的词根, 禁止使用类似111_222_333, 只一个keyword, 我用模糊搜索就行了
3. 范围：涵盖核心指标及相关辅助指标
4. 禁止：不写完整策略代码，不写SQL，不加注释，不写示例
*=========================================================================================*
输出格式：
输出必须是且仅是纯文本。
每一行是一个完整、独立、语法正确的WebSim表达式。
严禁任何形式的解释、编号、标点包裹（如引号）、Markdown格式或额外文本。
===================== !!! 重点(输出方式) !!! =====================
现在，请严格遵守以上所有规则，开始生成可立即在WebSim中运行的复合因子表达式。
**输出格式**(一行一个表达式, 每个表达式中间需要添加一个空行, 只要表达式本身, 不要解释, 不需要序号, 也不要输出多余的东西)：
表达式
表达式
表达式
...
表达式
=================================================================
重申：请确保所有表达式都使用WorldQuant WebSim平台函数，不要使用pandas、numpy或其他Python库函数。输出必须是一行有效的WQ表达式。
以下是我的账号有权限使用的操作符, 请严格按照操作符, 以及我提供的数据集, 进行生成,组合 10 个alpha：

以下是我的账号有权限使用的操作符, 请严格按照操作符, 进行生成,组合因子

========================= 操作符开始 =======================================注意: Operator: 后面的是操作符,
Description: 此字段后面的是操作符对应的描述或使用说明, Description字段后面的内容是使用说明, 不是操作符
特别注意!!!! 必须按照操作符字段Operator的使用说明生成 alphaOperator: abs(x)
Description: Absolute value of x
Operator: add(x, y, filter = false)
Description: Add all inputs (at least 2 inputs required). If filter = true, filter all input NaN to 0 before adding
Operator: densify(x)
Description: Converts a grouping field of many buckets into lesser number of only available buckets so as to make working with grouping fields computationally efficient
Operator: divide(x, y)
Description: x / y
Operator: inverse(x)
Description: 1 / x
Operator: log(x)
Description: Natural logarithm. For example: Log(high/low) uses natural logarithm of high/low ratio as stock weights.
Operator: max(x, y, ..)
Description: Maximum value of all inputs. At least 2 inputs are required
Operator: min(x, y ..)
Description: Minimum value of all inputs. At least 2 inputs are required
Operator: multiply(x ,y, ... , filter=false)
Description: Multiply all inputs. At least 2 inputs are required. Filter sets the NaN values to 1
Operator: power(x, y)
Description: x ^ y
Operator: reverse(x)
Description: - x
Operator: sign(x)
Description: if input > 0, return 1; if input < 0, return -1; if input = 0, return 0; if input = NaN, return NaN;
Operator: signed_power(x, y)
Description: x raised to the power of y such that final result preserves sign of x
Operator: sqrt(x)
Description: Square root of x
Operator: subtract(x, y, filter=false)
Description: x-y. If filter = true, filter all input NaN to 0 before subtracting
Operator: and(input1, input2)
Description: Logical AND operator, returns true if both operands are true and returns false otherwise
Operator: if_else(input1, input2, input 3)
Description: If input1 is true then return input2 else return input3.
Operator: input1 < input2
Description: If input1 < input2 return true, else return false
Operator: input1 <= input2
Description: Returns true if input1 <= input2, return false otherwise
Operator: input1 == input2
Description: Returns true if both inputs are same and returns false otherwise
Operator: input1 > input2
Description: Logic comparison operators to compares two inputs
Operator: input1 >= input2
Description: Returns true if input1 >= input2, return false otherwise
Operator: input1!= input2
Description: Returns true if both inputs are NOT the same and returns false otherwise
Operator: is_nan(input)
Description: If (input == NaN) return 1 else return 0
Operator: not(x)
Description: Returns the logical negation of x. If x is true (1), it returns false (0), and if input is false (0), it returns true (1).
Operator: or(input1, input2)
Description: Logical OR operator returns true if either or both inputs are true and returns false otherwise
Operator: days_from_last_change(x)
Description: Amount of days since last change of x
Operator: hump(x, hump = 0.01)
Description: Limits amount and magnitude of changes in input (thus reducing turnover)
Operator: kth_element(x, d, k)
Description: Returns K-th value of input by looking through lookback days. This operator can be used to backfill missing data if k=1
Operator: last_diff_value(x, d)
Description: Returns last x value not equal to current x value from last d days
Operator: ts_arg_max(x, d)
Description: Returns the relative index of the max value in the time series for the past d days. If the current day has the max value for the past d days, it returns 0. If previous day has the max value for the past d days, it returns 1
Operator: ts_arg_min(x, d)
Description: Returns the relative index of the min value in the time series for the past d days; If the current day has the min value for the past d days, it returns 0; If previous day has the min value for the past d days, it returns 1.
Operator: ts_av_diff(x, d)
Description: Returns x - tsmean(x, d), but deals with NaNs carefully. That is NaNs are ignored during mean computation
Operator: ts_backfill(x,lookback = d, k=1, ignore="NAN")
Description: Backfill is the process of replacing the NAN or 0 values by a meaningful value (i.e., a first non-NaN value)
Operator: ts_corr(x, y, d)
Description: Returns correlation of x and y for the past d days
Operator: ts_count_nans(x ,d)
Description: Returns the number of NaN values in x for the past d days
Operator: ts_covariance(y, x, d)
Description: Returns covariance of y and x for the past d days
Operator: ts_decay_linear(x, d, dense = false)
Description: Returns the linear decay on x for the past d days. Dense parameter=false means operator works in sparse mode and we treat NaN as 0. In dense mode we do not.
Operator: ts_delay(x, d)
Description: Returns x value d days ago
Operator: ts_delta(x, d)
Description: Returns x - ts_delay(x, d)
Operator: ts_mean(x, d)
Description: Returns average value of x for the past d days.
Operator: ts_product(x, d)
Description: Returns product of x for the past d days
Operator: ts_quantile(x,d, driver="gaussian" )
Description: It calculates ts_rank and apply to its value an inverse cumulative density function from driver distribution. Possible values of driver (optional ) are "gaussian", "uniform", "cauchy" distribution where "gaussian" is the default.
Operator: ts_rank(x, d, constant = 0)
Description: Rank the values of x for each instrument over the past d days, then return the rank of the current value + constant. If not specified, by default, constant = 0.
Operator: ts_regression(y, x, d, lag = 0, rettype = 0)
Description: Returns various parameters related to regression function
Operator: ts_scale(x, d, constant = 0)
Description: Returns (x - ts_min(x, d)) / (ts_max(x, d) - ts_min(x, d)) + constant. This operator is similar to scale down operator but acts in time series space
Operator: ts_std_dev(x, d)
Description: Returns standard deviation of x for the past d days
Operator: ts_step(1)
Description: Returns days' counter
Operator: ts_sum(x, d)
Description: Sum values of x for the past d days.
Operator: ts_zscore(x, d)
Description: Z-score is a numerical measurement that describes a value's relationship to the mean of a group of values. Z-score is measured in terms of standard deviations from the mean: (x - tsmean(x,d)) / tsstddev(x,d). This operator may help reduce outliers and drawdown.
Operator: normalize(x, useStd = false, limit = 0.0)
Description: Calculates the mean value of all valid alpha values for a certain date, then subtracts that mean from each element
Operator: quantile(x, driver = gaussian, sigma = 1.0)
Description: Rank the raw vector, shift the ranked Alpha vector, apply distribution (gaussian, cauchy, uniform). If driver is uniform, it simply subtract each Alpha value with the mean of all Alpha values in the Alpha vector
Operator: rank(x, rate=2)
Description: Ranks the input among all the instruments and returns an equally distributed number between 0.0 and 1.0. For precise sort, use the rate as 0
Operator: scale(x, scale=1, longscale=1, shortscale=1)
Description: Scales input to booksize. We can also scale the long positions and short positions to separate scales by mentioning additional parameters to the operator
Operator: winsorize(x, std=4)
Description: Winsorizes x to make sure that all values in x are between the lower and upper limits, which are specified as multiple of std.
Operator: zscore(x)
Description: Z-score is a numerical measurement that describes a value's relationship to the mean of a group of values. Z-score is measured in terms of standard deviations from the mean
Operator: vec_avg(x)
Description: Taking mean of the vector field x
Operator: vec_sum(x)
Description: Sum of vector field x
Operator: bucket(rank(x), range="0, 1, 0.1" or buckets = "2,5,6,7,10")
Description: Convert float values into indexes for user-specified buckets. Bucket is useful for creating group values, which can be passed to GROUP as input
Operator: trade_when(x, y, z)
Description: Used in order to change Alpha values only under a specified condition and to hold Alpha values in other cases. It also allows to close Alpha positions (assign NaN values) under a specified condition
Operator: group_backfill(x, group, d, std = 4.0)
Description: If a certain value for a certain date and instrument is NaN, from the set of same group instruments, calculate winsorized mean of all non-NaN values over last d days
Operator: group_mean(x, weight, group)
Description: All elements in group equals to the mean
Operator: group_neutralize(x, group)
Description: Neutralizes Alpha against groups. These groups can be subindustry, industry, sector, country or a constant
Operator: group_rank(x, group)
Description: Each elements in a group is assigned the corresponding rank in this group
Operator: group_scale(x, group)
Description: Normalizes the values in a group to be between 0 and 1. (x - groupmin) / (groupmax - groupmin)
Operator: group_zscore(x, group)
Description: Calculates group Z-score - numerical measurement that describes a value's relationship to the mean of a group of values. Z-score is measured in terms of standard deviations from the mean. zscore = (data - mean) / stddev of x for each instrument within its group.
========================= 操作符结束 =======================================

========================= 数据字段开始 =======================================
注意: data_set_name: 后面的是数据字段(可以使用), description: 此字段后面的是数据字段对应的描述或使用说明(不能使用), description_cn字段后面的内容是中文使用说明(不能使用)

{'id': '240', 'data_set_name': 'fnd6_eventv110_gdwlieps12', 'description': 'Impairment of Goodwill Basic EPS Effect 12MM', 'description_cn': 'goodwill_impairment_basic_eps_effect_12m'}
{'id': '241', 'data_set_name': 'fnd6_eventv110_gdwliepsq', 'description': 'Impairment of Goodwill Basic EPS Effect', 'description_cn': 'goodwill impairment basic_eps_effect'}
{'id': '624', 'data_set_name': 'fnd6_newqeventv110_spcedq', 'description': 'S&P Core Earnings EPS Diluted', 'description_cn': 'SPCE earnings per share diluted'}
{'id': '691', 'data_set_name': 'fnd6_newqv1300_cipenq', 'description': 'Comp Inc - Minimum Pension Adj', 'description_cn': '-comp_inc_min_pension_adj'}
{'id': '989', 'data_set_name': 'fscore_bfl_total', 'description': 'The final score M-Score is a weighted average of both the Pentagon surface score and the Pentagon acceleration score.', 'description_cn': '最终得分M-_score=五角大楼表面评分与五角加速评分加权平均值'}
{'id': '997', 'data_set_name': 'fscore_total', 'description': 'The final score M-Score is a weighted average of both the Pentagon surface score and the Pentagon acceleration score.', 'description_cn': '最终得分M-Score是pentagon表面分数和pentagon加速度分数的加权平均值。'}
{'id': '1292', 'data_set_name': 'anl4_netdebt_flag', 'description': 'Net debt - forecast type (revision/new/...)', 'description_cn': '净负债-预测类型(修正值/新值/....ImageAlign\n<|im_start|>user\nstrike price'}
{'id': '1927', 'data_set_name': 'news_max_up_amt', 'description': 'The after the news high minus the price at the time of the news', 'description_cn': '新闻后 highs 减去 新闻时 price 的差值'}
{'id': '1931', 'data_set_name': 'news_mins_10_pct_up', 'description': 'Number of minutes that elapsed before price went up 10 percentage points', 'description_cn': '价格上升10个百分点前elapsed分钟数'}
{'id': '1933', 'data_set_name': 'news_mins_1_pct_dn', 'description': 'Number of minutes that elapsed before price went down 1 percentage point', 'description_cn': '价格下跌1个百分点前elapsed分钟数'}
{'id': '1934', 'data_set_name': 'news_mins_1_pct_up', 'description': 'Number of minutes that elapsed before price went up 1 percentage point', 'description_cn': '价格上升1个百分点前elapsed的分钟数'}
{'id': '1942', 'data_set_name': 'news_mins_3_pct_dn', 'description': 'Number of minutes that elapsed before price went down 3 percentage points', 'description_cn': '价格下跌3个百分点前 elapsed_分钟数'}
{'id': '1945', 'data_set_name': 'news_mins_4_pct_dn', 'description': 'Number of minutes that elapsed before price went down 4 percentage points', 'description_cn': '价格下跌4个百分点前elapsed分钟数'}
{'id': '1946', 'data_set_name': 'news_mins_4_pct_up', 'description': 'Number of minutes that elapsed before price went up 4 percentage points', 'description_cn': '价格上漲4個百分點前 elapsed 分钟数'}
{'id': '1985', 'data_set_name': 'nws12_afterhsz_01s', 'description': 'Number of minutes that elapsed before price went down 10 percentage points', 'description_cn': '价格下跌至低于初始水平10个百分点前elapsed分钟数'}
{'id': '1992', 'data_set_name': 'nws12_afterhsz_1l', 'description': 'Number of minutes that elapsed before price went up 1 percentage points', 'description_cn': '涨价至1个百分点前elapsed分钟数'}
{'id': '1999', 'data_set_name': 'nws12_afterhsz_3l', 'description': 'Number of minutes that elapsed before price went up 3 percentage points', 'description_cn': '价格上涨3个百分点前elapsed的分钟数'}
{'id': '2001', 'data_set_name': 'nws12_afterhsz_3s', 'description': 'Number of minutes that elapsed before price went down 3 percentage points', 'description_cn': '价格下跌3个百分点前elapsed分钟数'}
{'id': '2003', 'data_set_name': 'nws12_afterhsz_4l', 'description': 'Number of minutes that elapsed before price went up 4 percentage points', 'description_cn': '价格上升4个百分点前elapsed分钟数'}
{'id': '2089', 'data_set_name': 'nws12_mainz_4l', 'description': 'Number of minutes that elapsed before price went up 4 percentage points', 'description_cn': '价格上涨4个百分点前elapsed分钟数'}
{'id': '2154', 'data_set_name': 'nws12_prez_02s', 'description': 'Number of minutes that elapsed before price went down 20 percentage points', 'description_cn': '价格下跌20个百分点前elapsed分钟数'}
{'id': '2168', 'data_set_name': 'nws12_prez_3s', 'description': 'Number of minutes that elapsed before price went down 3 percentage points', 'description_cn': '价格下跌3个百分点前elapsed分钟数'}
{'id': '2172', 'data_set_name': 'nws12_prez_4s', 'description': 'Number of minutes that elapsed before price went down 4 percentage points', 'description_cn': '价格下跌4个百分点前elapsed分钟数'}
{'id': '2173', 'data_set_name': 'nws12_prez_57l', 'description': 'Number of minutes that elapsed before price went up 7.5 percentage points', 'description_cn': '价格上升7.5个百分点前elapsed分钟数'}
{'id': '2179', 'data_set_name': 'nws12_prez_5s', 'description': 'Number of minutes that elapsed before price went down 5 percentage points', 'description_cn': '价格下跌5个百分点前elapsed分钟数'}
========================= 数据字段结束 =======================================